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5 METALLOPROTEINASE-DISINTEGRIN POLYPEPTIDES 

CROSS REFERENCE TO RELATED APPLICATIONS 

This application is a continuation of U.S. application Serial No. 09/890,323 having a filing 
10 date of December 10, 2001, which is a 35 U.S.C. §371 filing of International Application Number 
PCT/US00/01338 having an international filing date of January 21, 2000 and published under PCT 
Article 21(2) in English on July 27, 2000, said International Application claiming the benefit of United 
States provisional application S.N. 60/116,670; S.N. 60/138,682; and S.N. 60/155,798; filed January 
21, 1999; June 14, 1999; and September 27, 1999, respectively. The entire disclosures of these 
15 applications are relied upon and incorporated by reference herein. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The invention is directed to purified and isolated, novel SVPH polypeptides (SVPH-1, 
SVPH-la, SVPH- lb, SVPH-lc; SVPH-3; and SVPH-4, SVPH-4a, and SVPH-4b) and fragments 
20 thereof, the nucleic acids encoding such polypeptides, processes for production of recombinant forms 
of such polypeptides, antibodies generated against these polypeptides, fragmented peptides derived 
from these polypeptides, and uses thereof. 



Description of Related Art 

25 Metalloproteinases are a group of proteinases characterized by the presence of a metal 

prosthetic group. Despite this basic similarity, the group, which includes proteinases from snake 
venom, numerous microbial proteinases, and vertebrate and bacterial collagenases, would seem to 
present proteinases of seemingly widely varying activities. For example, snake venom proteases are 
metalloproteinases that affect cell-matrix interactions. Snake venom also includes "disintegrins," a 

30 class of low molecular weight, Arg-Gly-Asp (RGD) -containing, cysteine-rich peptides which bind to 
integrins (a family of molecules involved in cell-to-cell adhesion, cell-to-matrix adhesion, and 
inflammatory responses) expressed on cells. 

Also included are the membrane-anchored ADAMs (A Disintegrin And Metalloproteinase), 
which are multimeric molecules consisting of metalloproteinase, disintegrin-like, cysteine rich, and 

35 epidermal growth factor domains. See, Black et al., "ADAMs: focus on the protease domain," Curr. 
Opin. Cell Biol 10:654-659 (1998); Wolfsberg, T.G. et al., "ADAMs in fertilization and development," 
Dev. Bio. 180:389-401 (1996), all of which are herein incorporated by reference. The 
metalloproteinase-disintegrins or ADAMs have a unique domain structure composed of a signal 
sequence, pro-domain with a Cys switch, catalytic domain with a zinc-binding motif, disintegrin 

40 domain, cysteine-rich domain, a transmembrane domain, and a cytoplasmic domain (Black et al., 
"ADAMs: focus on the protease domain," Curr. Opin. Cell Biol. 10:654-659 (1998) ; Blobel, C. P., 
Cell, 90:589-592 (1997)). Thus, ADAMs are type 1 transmembrane proteins expressed on the cell 
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5 surface. ADAMs have been isolated from mammalian species, Caenorhabditis, Xenopus, and 
Drosophila. Approximately half of the ADAMs do not contain the zinc-binding motif 
HEXXHXXGXXHD (SEQ ID NO:31), which is though to be required for enzymatic activity. 
However, all ADAMs contain the disintegrin domain, which is approximately 80 amino acids in length 
with 15 highly conserved Cys residues. In some members this region has been found to bind integrins 

10 (Almeida, E.A. et al., Cell 81:1095-1104 (1995); Zhang, X. P. et al., /. Biol. Chem. 273:7345-7350 
(1998); Nath, D. et al., /. Cell ScL 112:579-587 (1999)), although the role of this domain for the 
majority of the family members is unknown. 

Over two dozen ADAMs have been identified but only a few have had their biological roles 
elucidated. Tumor necrosis factor-a converting enzyme (TACE/ADAM17) was isolated as the 

15 proteinase required for the shedding of TNF-a from the plasma membrane. See, Blobel, CP, Cell, 
90:589-592 (1997); Moss, M. et al., Nature 385:733-736 (1997); Black, R.A. et al., Nature 385:729- 
733 (1997). More recently TACE/ADAM17 has been found to be required for the ectodomain 
shedding of other cell surface proteins including L-selectin, TGF-a, p80 TNFR, p60TNFR, L-selectin, 
type II IL-1R, and p-amyloid precursor protein (Peschon, J. J. et al., Science 282:1281-1284 (1998)). 

20 Fertilin-a/ADAMl and fertilin-p/ADAM2 are required for sperm-egg fusion (Myles, D. G. et al., Proc. 
Natl Acad. ScL, USA 91:4195-4198 (1994)) while meltrin-o/ADAM12 has a role in muscle cell fusion 
(Yagami-Hiromasa, T. et al., Nature 377:652-656 (1995)). In addition MDC/ADAM11 is a candidate 
tumor suppressor gene (Emi, M. et al., Nat Genet. 5:151-157 (1993)) and Kuz /ADAM10 plays an 
important role in neurogenesis (Pan, D. et al., Cell 90: 271-280 (1997); Rooke, J. et al., Science 

25 273:1227-1231 (1996)). 

Some ADAMs are ubiquitously expressed such as ADAM9, ADAM 10, ADAM15, and 
AD AM 17 and may have pleiotropic effects, as has been found for ADAM 15 and ADAM 17. Many of 
the other ADAMs, however, show tissue- specific expression: ADAM 12 and ADAM 19 in muscle 
(Yagami-Hiromasa, T. et aL, Nature 377:652-656 (1995)), ADAM22 in brain, and ADAM23 in brain 

30 and heart (Sagane, K. et al., J. Biochem. 334:93-98 (1998)). The largest group of ADAMs (Bjarnason, 
J. B. et al., Methods EnzymoL 248: 345-368 (1995); Jia, L. G. et al., Toxicon 34:1269-1276 (1996); 
Stocker, W. et al., Protein ScL 4:823-840 (1995); Black, R. A. et al., Curr. Opin. Cell Biol. 10:654-659 
(1998); Blobel, C. P. , Cell 90:589-592 (1997); Almeida, E. A. et al., Cell 81:1095-1104 (1995); 
Zhang, X. P. et al., J. Biol Chem. 273:7345-7350 (1998); Wolfsberg, T. G. et al., Dev. Biol. 180:389- 

35 401 (1996); Zhu, G. Z. et al., Gene 234:227-237 (1999); Blobel, C. P. et al., Nature 356:248-252 
(1992); Walter, M. A. et al., Nat. Genet. 7:22-28 (1994); Gribskov, M. et al., Nucleic Acids Res. 
14:6745-6763 (1986); Bode, W. et al., FEBS Lett. 331:134-140 (1993); and Cerretti, D. P. et al., 
Cytokine 11:541-551 (1999)) is predominately expressed in testis and is thought to be involved in * 
spermatogenesis and fertilization (Wolfsberg, T. G. et al., Dev. Biol 180:389-401 (1996); Hooft van 

40 Huijsduijnen, R., Gene 206:273-282 (1998); Zhu, G. Z. et al., Gene 234:227-237 (1999)). Indeed, the 
first mammalian ADAMs discovered, ADAM1 and ADAM2, were found to be required for sperm-egg 
fusion (Zhu, G. Z. et al., Gene 234:227-237 (1999)). 
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5 The ADAMs family of metalloproteinase-disintegrins also share homology with the snake 

venom protease family (SVPH). In some snake venom protease members, the disintegrin domain 
prevents platelet aggregation and thus acts as an anti-coagulant. 

Given the significant function of metalloproteinases in membrane and cell-cell fusion, cellular 
adhesion, shedding of membrane proteins, and anti-coagulation, there is a need in the art for additional 
10 metalloproteinases of the ADAMs family and/or the SVPH family, including the discovery, 
identification, and roles of new proteins within these families. 

In another aspect, the identification of the primary structure, or sequence, of an unknown 
protein is the culmination of an arduous process of experimentation. In order to identify an unknown 
protein, the investigator can rely upon a comparison of the unknown protein to known peptides using a 
1 5 variety of techniques known to those skilled in the art. For instance, proteins are routinely analyzed 
using techniques such as electrophoresis, sedimentation, chromatography, sequencing and mass 
spectrometry. 

In particular, comparison of an unknown protein to polypeptides of known molecular weight 
allows a determination of the apparent molecular weight of the unknown protein (Brock, T. D. et al., 

20 Biology of Microorganisms 76-77 (1991)). Protein molecular weight standards are commercially 
available to assist in the estimation of molecular weights of unknown protein (New England Biolabs 
Inc. Catalog:130-131 (1995); J. L. Hartley, U.S. Patent No. 5,449,758). However, the molecular 
weight standards may not correspond closely enough in size to the unknown protein to allow an 
accurate estimation of apparent molecular weight. The difficulty in estimation of molecular weight is 

25 compounded in the case of proteins that are subjected to fragmentation by chemical or enzymatic 
means, modified by post-translational modification or processing, and/or associated with other proteins 
in non-covalent complexes. 

In addition, the unique nature of the composition of a protein with regard to its specific amino 
acid constituents results in unique positioning of cleavage sites within the protein. Specific 

30 fragmentation of a protein by chemical or enzymatic cleavage results in a unique "peptide fingerprint" 
(Cleveland, D. W. et al., /. Biol Chem. 252:1102-1106 (1977); Brown, M. et al., J. Gen. Virol. 50:309- 
316 (1980)). Consequently, cleavage at specific sites results in reproducible fragmentation of a given 
protein into peptides of precise molecular weights. Furthermore, these peptides possess unique charge 
characteristics that determine the isoelectric pH of the peptide. These unique characteristics can be 

35 exploited using a variety of electrophoretic and other techniques (Brock, T. D. et al., Biology of 
Microorganisms 76-77 (Prentice Hall, 6th ed. 1991)). 

Fragmentation of proteins is further employed for amino acid composition analysis and 
protein sequencing (Matsudiara, P., J. Biol. Chem. 262:10035-10038 (1987); Eckerskorn, C. et al., 
Electrophoresis 1988, 9:830-838 (1988)), particularly the production of fragments from proteins with a 

40 "blocked" N- terminus. In addition, fragmented proteins can be used for immunization, for affinity 
selection (R. A. Brown, U.S. Patent No. 5,151,412), for determination of modification sites {e.g. 
phosphorylation), for generation of active biological compounds (Brock, T. D. et al., Biology of 
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5 Microorganisms 300-301 (Prentice Hall, 6th ed. 1991)), and for differentiation of homologous proteins 
(Brown, M. et al., J. Gen. Virol. 50:309-316 (1980)). 

In addition, when a peptide fingerprint of an unknown protein is obtained, it can be compared 
to a database of known proteins to assist in the identification of the unknown protein using mass 
spectrometry (Henzel, W. J. et al., Proc. Natl. Acad. ScL USA 90:5011-5015 (1993); Fenyo, D. et al., 

10 Electrophoresis 19:998-1005 (1998)). A variety of computer software programs to facilitate these 
comparisons are accessible via the Internet, such as Protein Prospector (Internet site: 
prospector.uscf.edu), Multildent (Internet site: www.expasy.ch/sprot/multiident.html), PeptideSearch 
(Internet site: www.mann.embl-heiedelberg.de.. .deSearch/FR_PeptideSearchForm.html), and Pro- 
Found (Internet site: www.chait-sgi.rockefeller.edu/cgi-bin/prot-id-frag.html). These programs allow 

15 the user to specify the cleavage agent and the molecular weights of the fragmented peptides within a 
designated tolerance. The programs compare these molecular weights to protein molecular weight 
information stored in databases to assist in determining the identity of the unknown protein. Accurate 
information concerning the number of fragmented peptides and the precise molecular weight of those 
peptides is required for accurate identification. Therefore, increasing the accuracy in determining the 

20 number of fragmented peptides and their molecular weight should result in enhanced likelihood of 
success in the identification of unknown proteins. 

In addition, peptide digests of unknown proteins can be sequenced using tandem mass 
spectrometry (MS/MS) and the resulting sequence searched against databases (Eng, J. K. et al., J. Am. 
Soc. Mass Spec. 5:976-989 (1994); Mann, M. et al., Anal. Chem. 66:4390-4399 (1994); Taylor, J.A. et 

25 al., Rapid Comm. Mass Spec. 11:1067-1075 (1997)). Searching programs that can be used in this 
process exist on the Internet, such as Lutefisk 97 (Internet site: www.lsbc.com:70/Lutefisk97.html), 
and the Protein Prospector, Peptide Search and ProFound programs described above. Therefore, 
adding the sequence of a gene and its predicted protein sequence and peptide fragments to a sequence 
database can aid in the identification of unknown proteins using tandem mass spectrometry. 

30 Thus, there also exists a need in the art for polypeptides suitable for use in peptide 

fragmentation studies, for use in molecular weight measurements, and for use in protein sequencing 
using tandem mass spectrometry. 

SUMMARY OF THE INVENTION 

35 The invention aids in fulfilling these various needs in the art by providing isolated, novel 

SVPH nucleic acids and polypeptides encoded by these nucleic acids. Particular embodiments of the 
invention are directed to an isolated SVPH nucleic acid molecule comprising the DNA sequence of 
SEQ ID NOs:l-3 and isolated SVPH nucleic acid molecules encoding the amino acid sequence of SEQ 
ED NOs:4-6, as well as nucleic acid molecules complementary to these sequences. Further studies have 

40 revealed the full-length nucleotide sequences of three alternatively spliced SVPH-1 clones (SEQ ID 
NOs:7-9) and two alternatively spliced SVPH 4 clones (SEQ ID NOs:10-ll). Thus, further 
embodiments of the invention are directed to an isolated SVPH nucleic acid molecule comprising the 
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5 DNA sequence of SEQ ID NOs:7-ll and isolated SVPH nucleic acid molecules encoding the amino 
acid sequence of SEQ ID NOs: 12-16, as well as nucleic acid molecules complementary to these 
sequences. Both single-stranded and double-stranded RNA and DNA nucleic acid molecules are 
encompassed by the invention, as well as nucleic acid molecules that hybridize to a denatured, double- 
stranded DNA comprising all or a portion of SEQ ID NOs:l-3 and 7-11. Also encompassed are 
10 isolated nucleic acid molecules that are derived by in vitro mutagenesis of nucleic acid molecules 
comprising sequences of SEQ ID NOs:l-3 and 7-11, that are degenerate from nucleic acid molecules 
comprising sequences of SEQ ID NOs:l-3 and 7-11, and that are allelic variants of DNA of the 
invention. The invention also encompasses recombinant vectors that direct the expression of these 
nucleic acid molecules and host cells stably or transiently transformed or transfected with these vectors. 
15 In addition, the invention encompasses methods of using the nucleic acids noted above to 

identify nucleic acids encoding proteins having metalloproteinase-disintegrin activities; to identify 
human chromosome number 1 or 4; to map genes on human chromosome number lor 4; to identify 
genes associated with certain diseases, syndromes, or other human conditions associated with human 
chromosome number 1 or 4; and to study proteinases and their activities on cell/cell interactions as well 
20 as proteinase activity on the immune system. 

The invention also encompasses the use of sense or antisense oligonucleotides from the 
nucleic acid of SEQ ID NOs:l-3 and 7-11 to inhibit the expression of the polynucleotides encoded by 
the SVPH-1, SVPH-3, or SVPH-4 genes. 

The invention also encompasses isolated polypeptides and fragments thereof encoded by these 
25 nucleic acid molecules including soluble polypeptide portions of SEQ ID Nos:4-6 and 12-16. The 
invention further encompasses methods for the production of these polypeptides, including culturing a 
host cell under conditions promoting expression and recovering the polypeptide from the culture 
medium. Especially, the expression of these polypeptides in bacteria, yeast, plant, insect, and animal 
cells is encompassed by the invention. 
30 In general, the polypeptides of the invention can be used to study the cell/cell and cell/matrix 

interactions involved in cellular processes (including cell fusion as in sperm/egg interactions, cell 
recognition and binding) as well as those involved in the immune system. In addition, these 
polypeptides can be used to identify other proteins associated with SVPH family members, ADAMs 
family members, and other metalloproteinases. 
35 In addition, the invention includes assays utilizing these polypeptides to screen for potential 

inhibitors of activity associated with polypeptide counter-structure molecules, and methods of using 
these polypeptides as therapeutic agents for the treatment of diseases mediated by SVPH polypeptide 
counter-structure molecules. Further, methods of using these polypeptides in the design of inhibitors 
thereof are also an aspect of the invention. 
40 The invention further provides a method for using these polypeptides as molecular weight 

markers that allow the estimation of the molecular weight of a protein or a fragmented protein, as well 
as a method for the visualization of the molecular weight markers of the invention thereof using 
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5 electrophoresis. The invention further encompasses methods for using the polypeptides of the 
invention as markers for determining the isoelectric point of an unknown protein, as well as controls 
for establishing the extent of fragmentation of a protein. Further encompassed by this invention are 
kits to aid in these determinations. 

Isolated polyclonal or monoclonal antibodies that bind to these polypeptides are also 
10 encompassed by the invention, in addition the use of these antibodies to aid in purifying the SVPH 
polypeptide. 

Further encompassed by this invention is the use of the SVPH nucleic acid sequences, 
predicted amino acid sequences of the polypeptide or fragments thereof, or a combination of the 
predicted amino acid sequences of the polypeptide and fragments thereof for use in searching an 
15 electronic database to aid in the identification of sample nucleic acids and/or proteins. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 depicts a Northern blot hybridization showing the tissue specificity of SVPH-1 and 

SVPH-4. 

20 Figure 2 depicts a phylogenetic tree of metalloproteinase-disintegrins. Branches marked with 

heavy lines indicate ADAM family members with a consensus zinc-binding motif 
(HEXXHXXGXXHD) (SEQ ID NO:31). The arrow indicates the probable zinc-binding motif 
containing common ancestor. Lineages in which the zinc-binding site was subsequently lost are 
denoted with an 'X'. Species abbreviations: Mm, Mus musculus; Rn, Rattus norvegicus; Hs, Homo 

25 sapiens; Mf, Macaca fascicularis; Oc, Oryctolagus cuniculus; Cc, Cavia cobaya, Cp, Cavia porcellus; 
So, Saguinus oedipus; Pp, Pongo pygmaeus; Bt, Bos taurus. 



DETAILED DESCRIPTION OF THE INVENTION 
The nucleic acid molecules encompassed in the invention include the following nucleotide 
30 sequences: 

Name: SVPH-1 

1 ATTTTTGATA CCACAGTGAC CAACACGGTC ACCTAAGGTG TTCAATTCTT 
51 TGTAGCAAGT CTCACTTGCA GTATTTGCGC CTGCACCAAA AATCCTCCTA 
101 CACTGTTCAN TTGCGGTCAT GACANGCTC (SEQ ID NO: 1) 

35 

Name: SVPH-3 



1 TTTTTGAGTA AGAATAGGTC ATGTTTTAGT AAAACTTCCA AAAGAACAAA 

40 51 ACAGATTCTT CAACCCAGGA GGACATGTGA GTCACAATAC CCTTTAATCC 

101 ACAGGTTGGC TC C TTGGTTT CTGGAACTTT CTGCCTCCTG TAAACGATGT 

151 GCGGGTGGTA CCCTCCCTCA ACCAGTGGAT GCTTCTTCAC GGGTTCAATG 

201 AAAAAGTCTC CATGTGGTAG TTGGAAAAAT CCAGTCAGTC CATGGCAGGC 

2 51 ACTGAGGGCT GCCGTCCCAA CTCTGGTGCC CTGCTGTAGA ACCGTGCCAC 

45 301 TGAGATGGCA GAGGGGGGCA G AGGAAGC C A TCATCTTAAC ATGGGAGAGG 

351 TTCCCATATC TCTTCTCCAT GATGTAGCTA TTGGAAAGAA ATCCTTCATT 

401 GACCGTCAAG TTAAAAAACA GGTCCTTCTC C TC GTGAGAA ATTCTGTAGT 

451 ACACCCAGTC CTCTGAGCC (SEQ ID NO:2) 
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5 Name: SVPH-4 

1 CACGAGGATT TATATCTTCA AAGAAAATAT AATGATGCTC TTGCATGGTC 

51 GTTTGGAAAA GTGTGTTCTC TAGAATATGC TGGATCAGTG AGTACTTTAC 
101 TAGATACAAA TATCCTTGCC CCTGCTACCT GGTCTGCTCA TGAGC TGGGT 

10 151 C ATGC TGTAG GAATGTCACA TGATGAACAA TACTGCCAAT GTAGGGGTAG 

2 01 GCCTAATTGC ATCATGGGCT CAGGACGCAC TGGGTTTAGC AATTGCAGTT 

2 51 ATATCTCTTT TTTTAAACAT ATCTCTTCGG GAGCAACATG TCTAAATAAT 

3 01 ATCCCAGGAC TAGGTTATGT GCTTAAGAGA TGTGGAAACA AAATTGTGGA 
3 51 GGACAATGAG GAATGTGATT GTGGTTCCAC AGAGGAGTGT CAGAAAGATC 

15 401 GGTGTTGC C A ATCAAATTGT AAGTTGCAAC CAGGTGCCAA C TGTAGC ATT 

451 GGAC TTTGCT GTCATGATTG TCGGTTTCGT CCATCTGGAT AC GTGTGTAG 
501 GCAGGAAGGA AATGAATGTG ACCTTGCAGA GTACTGCGAC GGGAATTCAA 
551 GTTCCTGCCC AAATGAC GTT TATAAGCAGG ATGGAACCCC TTGCAAGTAT 
601 GAAGGCCGTT GTTTCAGGAA GGGGTGCAGA TCCAGATATA TGCAGTGCCA 

20 651 AAGCATTTTT GGAC C TGATG CCATGGAGGC TCCTAGTGAG TGCTATGATG 

7 01 CAGTTAACTT AATAGGTGAT CAATTTGGTA AC TGTGAG AT TACAGGAATT 
751 C GAAATTTT A AAAAGTGTGA AAGTGCAAAT TCAATATGTG GCAGGCTACA 
801 GTGTATAAAT GTTGAAAC C A TCCCTGATTT GCCAGAGCAT AC GACTATAA 
851 TTTCTACTCA TTTACAGGCA GAAAATC TC A TGTGC TGGGG CACAGGCTAT 

25 901 CATCTATCCA TGAAACCCAT GGGAATACCT GACCTAGGTA TGATAAATGA 

951 TGGCACCTCC TGTGGAGAAG GCCGGGTATG TTTTAAAAAA AATTGCGTCA 
1001 ATAGCTCAGT CCTGCAGTTT G AC TGTTTGC CTGAGAAATG CAATACCCGG 
1051 GGTGTTTGCA ACAACAGAAA AAACTGCCAC TGCATGTATG GGTGGGC AC C 
1101 TCCATTCTGT GAGGAAGTGG GGTATGGAGG AAGCATTGAC AGTGGGCCTC 

30 1151 CAGGACTGCT C AG AGGGGC G ATTCCCTCGT CAATTTGGGT TGTGTCCATC 

12 01 ATAATGTTTC GCCTTATTTT ATTAATCCTT TCAGTGGTTT TTGTGTTTTT 
1251 CCGGCAAGTG ATAGGAAACC ACTTAAAACC CAAACAGGAA AAAATGCCAC 

13 01 TATCCAAAGC AAAAACTGAA CAGGAAGAAT CTAAAACAAA AACTGTACAG 
1351 GAAGAATCTA AAACAAAAAC TGGACAGGAA GAATC TGAAG CAAAAACTGG 

35 14 01 ACAGGAAGAA TCTAAAGCAA AAAC TGG AC A GGAAGAATCT AAAGCAAACA 
1451 TTGAAAGTAA ACGACCCAAA GCAAAGAGTG TCAAGAAACA AAAAAAGTAA 
(SEQ ID NO: 3) 



40 



Name: SVPH-la 



1 ATGAAGATGT TACTCCTGCT GCATTGCCTT GGGGTGTTTC TGTCCTGTTC 

51 TGGACACATC CAGGATGAGC ACCCCCAATA TCACAGCCCT CCGGATGTGG 

101 TGATTCCTGT GAGGATAACT GGCACCACCA GAGGCATGAC ACCTCCAGGC 

151 TGGCTCTCCT ATATCCTGCC CTTTGGAGGC CAGAAACACA TTATC C AC AT 

45 2 01 AAAGGTCAAG AAGC TTTTGT TTTCCAAACA CCTCCCTGTG TTCACCTACA 

251 CAGACCAGGG TGCTATCCTT GAGGAC C AGC C ATTTGTC C A GAATAAC TGC 

3 01 TACTATCATG GTTATGTGGA AGGGGACCCA GAATC C C TGG TTTCCCTCAG 

351 TACCTGTTTT GGGGGTTTTC AAGGAATATT ACAGATAAAT GACTTTGCTT 

401 ATGAAATCAA GCCCCTAGCA TTTTCTACCA CGTTTGAACA TCTGGTATAC 

50 451 AAGATGGACA GTGAGGAGAA ACAATTTTCA ACCATGAGAT CCGGATTTAT 

501 GCAAAATGAA ATAACATGCC GAATGGAATT TGAAGAAATT GATAATTCCA 

551 CTCAGAAGCA AAGTTC TTAT GTGGGCTGGT GGATCCATTT TAGGATTGTT 

601 GAAATTGTAG TCGTCATTGA TAATTATCTG TACATTCGTT ATGAAAGGAA 

651 CGACTCAAAG TTGC TGGAGG ATCTATATGT TATTGTTAAT ATAGTGGATT 

55 7 01 CCATTTTGGA TGTCATTGGT GTTAAGGTGT TATTATTTGG TTTGGAGATC 

751 TGGACCAATA AAAACCTCAT TGTAGTAGAT GATGTAAGGA AATC TGTGC A 

8 01 CCTGTATTGC AAGTGGAAGT CGGAGAACAT TACGCCCCGG ATGCAACATG 

851 ACACCTCACA TCTTTTCACA ACTCTAGGAT TAAGAGGGTT AAGTGGCATA 

901 GGAGCTTTTA GAGGAATGTG TACACCACAC CGTAGTTGTG CAATTGTTAC 

60 951 TTTCATGAAC AAAACTTTGG GC AC TTTTTC AATTGCAGTG GCTCATCATC 

1001 TAGGTCATAA TTTGGGCATG AACCATGATG AGGATACATG TCGTTGTTCA 

1051 CAACCTAGAT GCATAATGCA TGAAGGCAAC CCACCAATAA C TAAATTT AG 

1101 CAATTGTAGT TATGGTGATT TTTGGGAATA TACTGTAGAG AGGACAAAGT 

1151 GTTTGC TTGA AACAGTACAC ACAAAGGACA TC TTTAATGT GAAGCGCTGT 

65 12 01 GGGAATGGTG TTGTTGAAGA AGGAGAAGAG TGTGACTGTG GACCTTTAAA 

1251 GCATTGTGCA AAAGATC C CT GCTGTCTGTC AAATTGCACT CTGAC TGATG 

13 01 GTTCTACTTG TGC TTTTGGG C TTTGTTGC A AAG AC TGC AA GTTCCTACCA 

1351 TCAGGGAAAG TGTGTAGAAA GGAGGTCAAT GAATGTGATC TTCCAGAGTG 

1401 GTGCAATGGT ACTTCCCATA AGTGCCCAGA TGACTTTTAT GTGGAAGATG 



7 



Metalloproteinase-Disintegrin Polypeptides 2517-USB 



5 1451 GAATTCCCTG TAAGGAGAGG GGCTACTGCT ATGAAAAGAG CTGTCATGAC 

1501 CGCAATGAAC AGTGTAGGAG GATTTTTGGT GCAGGCGCAA ATACTGCAAG 

1551 TGAGACTTGC TACAAAGAAT TGAAC AC CTT AGGTGACCGT GTTGGTCACT 

16 01 GTGGTATCAA AAATGCTACA TATATAAAGT GTAATATCTC AGATGTCCAG 
1651 TGTGGAAGAA TTCAGTGTGA GAATGTGACA GAAATTCCCA ATATGAGTGA 

10 17 01 TC ATAC TACT GTGCATTGGG CTCGCTTCAA TGACATAATG TGC TGGAGTA 

17 51 C TGATTACC A TTTGGGGATG AAGGGACCTG ATATTGGTGA AGTGAAAGAT 
1801 GGAACAGAGT GTGGGATAGA TCATATATGC ATCCACAGGC AC TGTGTC C A 
1851 T AT AAC CATC TTGAATAGTA ATTGCTCACC TGCATTTTGT AACAAGAGGG 
19 01 GCATCTGCAA CAATAAACAT CACTGCCATT GCAATTATCT GTGGGACCCT 

15 1951 CCCAACTGCC TGATAAAAGG CTATGGAGGT AGTGTTGACA GTGGCCCACC 

2 001 CCCTAAGAGA AAGAAGAAAA AGAAGTTCTG TTATCTGTGT ATATTGTTGC 

2051 TTATTGTTTT GTTTATTTTA TTATGTTGTC TTTATC GACT TTGTAAAAAA 

2101 AGTAAAC C AA TAAAAAAGCA GCAAGATGTT CAAACTCCAT CTGCAAAAGA 

2151 AGAGGAAAAA ATTCAGCGTC GACCTCATGA GTTACCTCCC CAGAGTCAAC 

20 22 01 C TTGGGTG AT GCCTTCCCAG AGTCAACCTC CTGTGACACC CTCCCAGAGG 

22 51 CAACCTCAGT TGATGCCTTC CCAGAGTCAA CCTCCTGTGA CGCCCTCCTA 

23 01 G (SEQIDNO:7) 
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Name: SVPH-lb 



1 ATGAAGATGT TACTCCTGCT GCATTGCCTT GGGGTGTTTC TGTC C TGTTC 

51 TGGACACATC CAGGATGAGC ACCCCCAATA TCACAGCCCT CCGGATGTGG 

101 TGATTCCTGT GAGGATAACT GGCACCACCA GAGGCATGAC ACCTCCAGGC 

151 TGGCTCTCCT ATATCCTGCC CTTTGGAGGC CAGAAACACA TTATCCACAT 

30 2 01 AAAGGTCAAG AAGCTTTTGT TTTCCAAACA CCTCCCTGTG TTCACCTACA 

2 51 CAGACCAGGG TGCTATCCTT GAGGACCAGC CATTTGTCCA GAATAACTGC 

3 01 TACTATCATG GTTATGTGGA AGGGGACCCA GAATCCCTGG TTTCCCTCAG 
351 TACCTGTTTT GGGGGTTTTC AAGGAATATT ACAGATAAAT GACTTTGCTT 
401 ATGAAATCAA GCCCCTAGCA TTTTCTACCA C GTTTGAAC A TC TGGT AT AC 

35 451 AAGATGGACA GTGAGGAGAA ACAATTTTCA ACCATGAGAT CCGGATTTAT 

5 01 GCAAAATGAA ATAACATGCC GAATGGAATT TGAAGAAATT GATAATTCCA 

551 CTCAGAAGCA AAGTTC TT AT GTGGGCTGGT GGATCCATTT TAGGATTGTT 

601 GAAATTGTAG TCGTCATTGA TAATTATCTG TACATTCGTT ATGAAAGGAA 

651 CGACTCAAAG TTGC TGGAGG ATCTATATGT TATTGTTAAT ATAGTGGATT 

40 7 01 CCATTTTGGA TGTCATTGGT GTTAAGGTGT TATTATTTGG TTTGGAGATC 

751 TGGACCAATA AAAACCTCAT TGTAGTAGAT GATGTAAGGA AATC TGTGC A 

801 CCTGTATTGC AAGTGGAAGT C GGAGAAC AT TACGCCCCGG ATGCAACATG 

851 ACACCTCACA TC TTTTC AC A ACTCTAGGAT TAAGAGGGTT AAGTGGCATA 

901 GGAGCTTTTA GAGGAATGTG TACACCACAC CGTAGTTGTG CAATTGTTAC 

45 951 TTTCATGAAC AAAAC TTTGG GCACTTTTTC AATTGCAGTG GCTCATCATC 

1001 TAGGTCATAA TTTGGGCATG AACCATGATG AGGATACATG TCGTTGTTCA 

1051 CAACCTAGAT GCATAATGCA TGAAGGCAAC CCACCAATAA CTAAATTTAG 

1101 CAATTGTAGT TATGGTGATT TTTGGGAATA TACTGTAGAG AGGACAAAGT 

1151 GTTTGCTTGA AACAGTACAC ACAAAGGACA TCTTTAATGT GAAGCGCTGT 

50 12 01 GGGAATGGTG TTGTTGAAGA AGGAGAAGAG TGTGACTGTG G AC C TTT AAA 

1251 GCATTGTGCA AAAGATCCCT GCTGTCTGTC AAATTGC AC T CTGACTGATG 

13 01 GTTCTACTTG TGCTTTTGGG CTTTGTTGCA AAGACTGCAA GTTCCTACCA 

1351 TCAGGGAAAG TGTGTAGAAA GGAGGTCAAT GAATGTGATC TTCCAGAGTG 

1401 GTGCAATGGT ACTTCCCATA AGTGCCCAGA TGACTTTTAT GTGGAAGATG 

55 1451 GAATTCCCTG TAAGGAGAGG GGCTACTGCT ATGAAAAGAG CTGTCATGAC 

1501 CGCAATGAAC AGTGTAGGAG GATTTTTGGT GCAGGCGCAA ATACTGCAAG 

1551 TGAGACTTGC TACAAAGAAT TGAAC ACCTT AGGTGACCGT GTTGGTCACT 

1601 GTGGTATCAA AAATGCTACA TATATAAAGT GTAATATCTC AGATGTCCAG 

1651 TGTGGAAGAA TTCAGTGTGA GAATGTGACA GAAATTCCCA ATATGAGTGA 

60 1701 TC AT AC TACT GTGCATTGGG CTCGCTTCAA TGACATAATG TGCTGGAGTA 

1751 C TGATTAC C A TTTGGGGATG AAGGGACCTG ATATTGGTGA AGTGAAAGAT 

1801 GGAACAGAGT GTGGGATAGA TCATATATGC ATCCACAGGC ACTGTGTC C A 

1851 TATAACCATC TTGAATAGTA ATTGCTCACC TGCATTTTGT AACAAGAGGG 

1901 GCATCTGCAA CAATAAACAT CACTGCCATT GCAATTATCT GTGGGACCCT 

65 1951 CCCAACTGCC TGATAAAAGG CTATGGAGGT AGTGTTGACA GTGGTCCACC 

2 001 CCCTAAGAGA AAGAAGAAAA AGAAGTTCTG TTATCTGTGT ATATTGTTGC 

2 051 TTATTGTTTT GTTTATTTTA TTATGTTGTC TTTATC GACT TTGTAAAAAA 

2101 AGTAAACCAA TAAAAAAGCA GCAAGATGTT CAAACTCCAT CTGCAAAAGA 

2151 AGAGGAAAAA ATTCAGCGTC GACCTCATGA GTTACCTCCC CAGAGTCAAC 
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5 2201 CTTGGGTGAT GCCTTCCCAG AGTCAACCTC CTGTGACGCC TTCCCAGAGT 

22 51 CATCCTCAGG TGATGC C TTC CCAGAGTCAA CCTCCTCAAA ATTTATTCCT 

2301 GTTCAGCTTC TCAATCAGTG ACTGTGTGCT AAATTTTAGG CTACTGTATC 

2351 TTCAGGCCAC CTGA (SEQ ID NO:8) 

10 Name: SVPH-lc 

1 ATGAAGATGT TACTCCTGCT GCATTGCCTT GGGGTGTTTC TGTCCTGTTC 

51 TGGACACATC CAGGATGAGC ACCCCCAATA TCACAGCCCT CCGGATGTGG 

101 TGATTCCTGT GAGGATAACT GGCACCACCA GAGGCATGAC ACCTCCAGGC 

15 151 TGGCTCTCCT ATATCCTGCC CTTTGGAGGC CAGAAACACA TTATCCACAT 

2 01 AAAGGTCAAG AAGC TTTTGT TTTCCAAACA CCTCCCTGTG TTCACCTACA 
251 CAGACCAGGG TGCTATCCTT GAGGACCAGC C ATTTGTC C A GAATAACTGC 

3 01 TACTATCATG GTTATGTGGA AGGGGACCCA GAATCCCTGG TTTCCCTCAG 
351 TACCTGTTTT GGGGGTTTTC AAGGAATATT ACAGATAAAT GACTTTGCTT 

20 401 ATGAAATCAA GCCCCTAGCA TTTTCTACCA CGTTTGAACA TCTGGTATAC 

451 AAGATGGACA GTGAGGAGAA ACAATTTTCA ACCATGAGAT CCGGATTTAT 

501 GCAAAATGAA ATAACATGCC GAATGGAATT TGAAGAAATT GATAATTCCA 

551 CTCAGAAGCA AAGTTC TT AT GTGGGC TGGT GGATCCATTT TAGGATTGTT 

601 GAAATTGTAG TCGTCATTGA TAATTATC TG TACATTCGTT ATGAAAGGAA 

25 651 CGACTCAAAG TTGCTGGAGG ATCTATATGT TATTGTTAAT ATAGTGGATT 

7 01 CCATTTTGGA TGTCATTGGT GTTAAGGTGT TATTATTTGG TTTGGAGATC 

7 51 TGGACCAATA AAAACCTCAT TGTAGTAGAT GATGTAAGGA AATCTGTGCA 

801 CCTGTATTGC AAGTGGAAGT CGGAGAACAT TACGCCCCGG ATGCAACATG 

851 ACACCTCACA TCTTTTCACA ACTCTAGGAT TAAGAGGGTT AAGTGGCATA 

30 901 GGAGC TTTTA GAGGAATGTG TACACCACAC CGTAGTTGTG CAATTGTTAC 

951 TTTCATGAAC AAAAC TTTGG GCACTTTTTC AATTGCAGTG GCTCATCATC 

1001 TAGGTCATAA TTTGGGCATG AACCATGATG AGGATACATG TCGTTGTTCA 

1051 CAACCTAGAT GCATAATGCA TGAAGGCAAC CCACCAATAA C T AAATTT AG 

1101 CAATTGTAGT TATGGTGATT TTTGGGAATA TACTGTAGAG AGGACAAAGT 

35 1151 GTTTGC TTGA AACAGTACAC ACAAAGGACA TCTTTAATGT GAAGCGCTGT 

12 01 GGGAATGGTG TTGTTGAAGA AGGAGAAGAG TGTG AC TGTG GAC CTTTAAA 
1251 GCATTGTGCA AAAGATCCC T GCTGTCTGTC AAATTGC AC T CTGACTGATG 

13 01 GTTCTACTTG TGC TTTTGGG CTTTGTTGCA AAGACTGCAA GTTCCTACCA 

13 51 TCAGGGAAAG TGTGTAGAAA GGAGGTCAAT GAATGTGATC TTCCAGAGTG 
40 1401 GTGCAATGGT ACTTCCCATA AGTGCCCAGA TGAC TTTTAT GTGGAAGATG 

14 51 GAATTCCCTG TAAGGAGAGG GGCTACTGCT ATGAAAAGAG CTGTCATGAC 
1501 CGCAATGAAC AGTGTAGGAG GATTTTTGGT GCAGGCGCAA ATACTGCAAG 
1551 TGAGACTTGC TACAAAGAAT TGAACACCTT AGGTGACCGT GTTGGTC AC T 
16 01 GTGGTATCAA AAATGC T AC A TATATAAAGT GTAATATCTC AGATGTC C AG 

45 1651 TGTGGAAGAA TTCAGTGTGA GAATGTGACA GAAATTCCCA ATATGAGTGA 

1701 TC AT ACT AC T GTGCATTGGG CTCGCTTCAA TGACATAATG TGCTGGAGTA 

1751 CTGATTACCA TTTGGGGATG AAGGGACCTG ATATTGGTGA AGTGAAAGAT 

1801 GGAACAGAGT GTGGGATAGA TCATATATGC ATCCACAGGC ACTGTGTCCA 

1851 TAT AAC CATC TTGAATAGTA ATTGCTCACC TGCATTTTGT AACAAGAGGG 

50 1901 GCATCTGCAA CAATAAACAT C AC TGC C ATT GCAATTATCT GTGGGAC CCT 

1951 CCCAACTGCC TGATAAAAGG CTATGGAGGT AGTGTTGACA GTGGCCCACC 

2001 CCCTAAGAGA AAGAAGAAAA AGAAGTTCTG TTATC TGTGT ATATTGTTGC 

2051 TTATTGTTTT GTTTATTTTA TTATGTTGTC TTTATCGACT TTGTAAAAAA 

2101 AGTAAACCAA TAAAAAAGCA GCAAGATGTT CAAACTCCAT CTGCAAAAGA 

55 2151 AGAGGAAAAA ATTCAGCGTC GACCTCATGA GTTACCTCCC CAGAGTCAAC 

22 01 CTTGGGTGAT GCCTTCCCAG AGTCAACCTC CTGTGACGCC TTCCCAGAGT 
2251 CATCCTCGGG TGATGCCTTC TCAGAGTCAA CCTCCTGTGA TGCCTTCCCA 

23 01 GAGTCATCCT CAGTTGACGC CTTCCCAGAG TCAACCTCCT GTGATGCCTT 
2351 CCCAGAGTCA TCCTCAGTTG ACGCCTTCCC AGAGTCAACC TCCTGTGACA 

60 2401 CCCTCCCAGA GGCAACCTCA GTTGATGCCT TCCCAGAGTC AACCTCCTGT 

2451 GACGCCCTCC TAG (SEQ ID NO: 9) 

Name: SVPH-4a 

65 1 ATGAGGTCAG TGCAGATCTT CCTCTCCCAA TGCCGTTTGC TCCTTCTACT 

51 AGTTCCGACA ATGCTCCTTA AGTCTCTTGG CGAAGATGTA ATTTTTCACC 

101 CTGAAGGGGA GTTTGACTCG TATGAAGTCA CCATTCCTGA GAAGCTGAGC 

151 TTCCGGGGAG AGGTGCAGGG TGTGGTCAGT CCCGTGTCCT ACCTACTGCA 
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2 01 GTTAAAAGGC 
251 TGCCCCGACA 
301 GAGGATCATC 

3 51 AGAGTCTCTG 
401 GAGGTGTATT 
451 GCCTCTCCCA 
501 TGGGAATCAG 
551 CCCCTTATGA 
601 CACCCAAAGT 
651 GTTTGTGAAC 
7 01 CTGGGATTAT 
751 AAGGCTCTTG 
801 AGAGTTAGCT 
851 TAAATGCTC G 
901 T ATAATGATG 
951 TGCTGGATC A 

1001 CCTGGTCTGC 

1051 CAATACTGCC 

1101 C AC TGGGTTT 

1151 CGGGAGCAAC 

12 01 AGATGTGGAA 

12 51 CACAGAGGAG 

13 01 AACCAGGTGC 
1351 CGTCCATCTG 

14 01 AGAGTACTGC 
1451 AGGATGGAAC 
1501 AGATCCAGAT 
1551 GGCTCCTAGT 
1601 GTAACTGTGA 
1651 AATTCAATAT 
1701 TTTGCCAGAG 
1751 TCATGTGCTG 
1801 CCTGACCTAG 
1851 ATGTTTT AAA 
1901 TGCCTGAGAA 
1951 C ACTGC ATGT 
2 001 AGGAAGC ATT 
2051 CGTCAATTTG 
2101 CTTTCAGTGG 
2151 ACCCAAACAG 
2201 AATCTAAAAC 

22 51 GAAGAATCTG 

23 01 AC AGGAAGAA 
2351 GTGTCAAGAA 

Name: SVPH-4b 

1 ATGAGGTCAG 

51 AGTTCCGACA 

101 CTGAAGGGGA 

151 TTCCGGGGAG 

2 01 GTTAAAAGGC 

2 51 TGCCCCGACA 

3 01 GAGGATCATC 
351 AGAGTCTCTG 
401 GAGGTGTATT 
451 GCCTCTCCCA 
501 TGGGAATCAG 
551 CCCCTTATGA 
601 CACCCAAAGT 
651 GTTTGTGAAC 
7 01 CTGGGATTAT 
7 51 AAGGCTCTTG 
801 AGAGTTAGCT 
851 TAAATGCTC G 



AAGAAGCACG 
TCTGCGCGTT 
CTTACATACC 
GACTCTAAAG 
TAACATTGAT 
GTTTTGAACA 
GTTTGTGGCT 
G AATAAGGC G 
ACTTGGAATT 
AACAATCTTT 
GGACACCTAC 
AAGTATGGAC 
GAAGTTTTAG 
CCTGTCATCA 
CTCTTGCATG 
GTGAGTACTT 
TCATGAGCTG 
AATGTAGGGG 
AGCAATTGCA 
ATGTCTAAAT 
ACAAAATTGT 
TGTCAGAAAG 
CAACTGTAGC 
GATACGTGTG 
GACGGGAATT 
CCCTTGCAAG 
ATATGCAGTG 
GAGTGCTATG 
GATTACAGGA 
GTGGCAGGCT 
CATACGACTA 
GGGCACAGGC 
GTATGATAAA 
AAAAATTGCG 
ATGCAATACC 
ATGGGTGGGC 
GACAGTGGGC 
GGTTGTGTCC 
TTTTTGTGTT 
GAAAAAATGC 
AAAAACTGTA 
AAGCAAAAAC 
TCTAAAGCAA 
ACAAAAAAAG 



TCCTCCATTT 
TTCTCCTTCA 
AAAGGACTGC 
CTACTATAAG 
GCCAAACATT 
TGTCGTCTAT 
TAAGTGATGA 
AGGCTAAGGG 
GATCCTACTC 
CTCAAGTCAT 
TTTCAAGATG 
AGATTTTAAC 
GCAGATTTGT 
GATTGGGCAC 
GTCGTTTGGA 
TACTAGATAC 
GGTCATGCTG 
TAGGCCTAAT 
GTTATATCTC 
AATATCCCAG 
GGAGGACAAT 
ATCGGTGTTG 
ATTGGACTTT 
TAGGCAGGAA 
CAAGTTCCTG 
TATGAAGGCC 
CCAAAGCATT 
ATGCAGTTAA 
ATTCGAAATT 
ACAGTGTATA 
TAATTTCTAC 
TATCATCTAT 
TGATGGCACC 
TCAATAGCTC 
CGGGGTGTTT 
ACCTCCATTC 
CTCCAGGACT 
ATCATAATGT 
TTTCCGGCAA 
CACTATCCAA 
CAGGAAGAAT 
TGGACAGGAA 
ACATTGAAAG 
TAA (SEQID 



GTGGCCCAAG 
CAGAACATGG 
AACTACATGG 
CACATGCATG 
ACCAAATTGA 
CTCC TGAAGA 
TGAAATAGAA 
ACTTTCCTGG 
TTTGATCAAA 
ACATGATGCC 
TTC GTATGAG 
AAAATACGCG 
AATATATAAA 
ATTTATATCT 
AAAGTGTGTT 
AAATATCCTT 
TAGGAATGTC 
TGCATCATGG 
TTTTTTTAAA 
G AC T AGGTTA 
GAGGAATGTG 
CCAATCAAAT 
GCTGTCATGA 
GGAAATGAAT 
CCCAAATGAC 
GTTGTTTCAG 
TTTGGACCTG 
CTTAATAGGT 
TTAAAAAGTG 
AATGTTGAAA 
TCATTTACAG 
CCATGAAACC 
TCCTGTGGAG 
AGTCCTGCAG 
GCAACAACAG 
TGTGAGGAAG 
GCTCAGAGGG 
TTCGCCTTAT 
GTGATAGGAA 
AGCAAAAACT 
CTAAAACAAA 
GAATC TAAAG 
TAAACGACCC 
TO: 10) 



AGACTTC TGT 
GGAACTGCTG 
GCTCCGTGAA 
GGGGGTCTCC 
GCCCCTCAAG 
AAGAGCAGTT 
TGGCAGATGG 
ATCCTATAAA 
GTAGGTATAG 
ATTCTTTTGA 
GATACACTTA 
TTGGATATCC 
AAAAGTGTAT 
TCAAAGAAAA 
CTCTAGAATA 
GCCCCTGCTA 
ACATGATGAA 
GCTCAGGACG 
CATATCTCTT 
TGTGCTTAAG 
ATTGTGGTTC 
TGTAAGTTGC 
TTGTCGGTTT 
GTGAC CTTGC 
GTTTATAAGC 
GAAGGGGTGC 
ATGCCATGGA 
GATCAATTTG 
TGAAAGTGCA 
CCATCCCTGA 
GCAGAAAATC 
CATGGGAATA 
AAGGCCGGGT 
TTTGACTGTT 
AAAAAACTGC 
TGGGGTATGG 
GCGATTCCCT 
TTTATTAATC 
ACCACTTAAA 
GAACAGGAAG 
AAC TGGAC AG 
CAAAAACTGG 
AAAGCAAAGA 



TGCAGATCTT CCTCTCCCAA 
ATGCTCCTTA AGTCTCTTGG 
GTTTGACTCG TATGAAGTCA 
AGGTGCAGGG TGTGGTCAGT 
AAGAAGCACG TCCTCCATTT 
TCTGCGCGTT TTCTCCTTCA 
CTTACATACC AAAGGACTGC 
GACTCTAAAG CTACTATAAG 
TAACATTGAT GCCAAACATT 
GTTTTGAACA TGTCGTCTAT 
GTTTGTGGCT TAAGTGATGA 
GAATAAGGCG AGGCTAAGGG 
ACTTGGAATT GATCCTACTC 
AACAATCTTT CTCAAGTCAT 
GGACACCTAC TTTCAAGATG 
AAGTATGGAC AGATTTTAAC 
GAAGTTTTAG GCAGATTTGT 
CCTGTCATCA GATTGGGCAC 



TGCCGTTTGC TCCTTCTACT 
CGAAGATGTA ATTTTTCACC 
CCATTCCTGA GAAGCTGAGC 
CCCGTGTCCT ACCTACTGCA 
GTGGCCCAAG AGACTTCTGT 
CAGAACATGG GGAACTGCTG 
AACTACATGG GCTCCGTGAA 
CACATGCATG GGGGGTCTCC 
ACCAAATTGA GCCCCTCAAG 
CTCCTGAAGA AAGAGCAGTT 
TGAAATAGAA TGGCAGATGG 
ACTTTCCTGG ATCCTATAAA 
TTTGATCAAA GTAGGTATAG 
ACATGATGCC ATTCTTTTGA 
TTCGTATGAG GATACACTTA 
AAAATACGCG TTGGATATCC 
AATATATAAA AAAAGTGTAT 
ATTTATATCT TCAAAGAAAA 
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2517-USB 



901 TATAATGATG CTCTTGCATG GTCGTTTGGA AAAGTGTGTT CTCTAGAATA 

951 TGCTGGATCA GTGAGTACTT T AC TAG AT AC AAATATCCTT GCCCCTGCTA 

1001 CCTGGCCTGC TCATGAGCTG GGTCATGCTG TAGGAATGTC ACATGATGAA 

1051 CAATACTGCC AATGTAGGGG TAGGCTTAAT TGCATCATGG GCTCAGGACG 

1101 CACTGGGTTT AGCAATTGCA GTTATATCTC TTTTTTTAAA CATATCTCTT 

1151 CGGGAGCAAC ATGTCTAAAT AATATCCCAG G AC T AGGTTA TGTGC TTAAG 

12 01 AGATGTGGAA ACAAAATTGT GGAGGACAAT GAGGAATGTG AC TGTGGTTC 

12 51 CACAGAGGAG TGTCAGAAAG ATC GGTGTTG CCAATCAAAT TGTAAGTTGC 

13 01 AACCAGGTGC CAACTGTAGC ATTGG AC TTT GCTGTCATGA TTGTC GGTTT 
13 51 CGTCCATCTG GATAC GTGTG TAGGCAGGAA GGAAATGAAT GTGACCTTGC 
1401 AG AGT AC TGC GACGGGAATT CAAGTTCCTG CCCAAATGAC GTTTATAAGC 
1451 AGGATGGAAC CCCTTGCAAG TATGAAGGCC GTTGTTTCAG GAAGGGGTGC 
1501 AGATCCAGAT ATATGCAGTG CCAAAGCATT TTTGGACCTG ATGCCATGGA 
1551 GGCTCCTAGT GAGTGCTATG ATGCAGTTAA CTTAATAGGT GATCAATTTG 

16 01 GTAACTGTGA GATTACAGGA ATTCGAAATT TTAAAAAGTG TGAAAGTGCA 
1651 AATTCAATAT GTGGCAGGCT ACAGTGTATA AATGTTGAAA CCATCCCTGA 

17 01 TTTGC C AGAG CATACGACTA T AATTTC T AC TCATTTACAG GCAGAAAATC 

17 51 TCATGTGCTG GGGCACAGGC TATCATCTAT CCATGAAACC CATGGGAATA 

18 01 CCTGACCTAG GTATGATAAA TGATGGC AC C TCCTGTGGAG AAGGCCGGGT 
1851 ATGTTTTAAA AAAAATTGCG TCAATAGCTC AGTCCTGCAG TTTGACTGTT 
1901 TGCCTGAGAA ATGCAATACC CGGGGTGTTT GCAACAACAG AAAAAACTGC 
1951 CACTGCATGT ATGGGTGGGC ACCTCCATTC TGTGAGGAAG TGGGGTATGG 
2001 AGGAAGCATT GACAGTGGGC CTCCAGGACT GCTCAGAGGG GCGATTCCCT 
2051 CGTCAATTTG GGTTGTGTCC ATCATAATGT TTCGCCTTAT TTTATTAATC 
2101 CTTTCAGTGG TTTTTGTGTT TTTCCGGCAA GTGATAGGAA AC C AC TT AAA 
2151 AC C C AAAC AG GAAAAAATGC CACTATCCAA AGCAAAAACT GAACAGGAAG 
22 01 AATC TAAAAC AAAAAC TGTA CAGGAAGAAT CTAAAACAAA AACTGGACAG 

22 51 GAAGAATCTG AAGC AAAAAC TGGACAGGAA GAATCTAAAG CAAACATTGA 

23 01 AAGTAAACGA CCCAAAGCAA AGAGTGTCAA GAAACAAAAA AAGTAA 
(SEQ ID NO: 11) 

The amino acid sequences of the polypeptides encoded by the nucleotide sequence of the 
invention includes: 



Name: SVPH-1 (polypeptide) 



1 MTAXEQCRRI FGAGANTASE TCYKELNTLG DRVGHCGIKN (SEQ ID NO:4) 



Name: SVPH-3 (polypeptide) 



1 EDWVYYRISH EEKDLFFNLT VNEGFLSNSY IMEKRYGNLS HVKMMAS SAP 
51 LCHLSGTVLiQ QGTRVGTAAL SACHGLTGFF QLPHGDFFIE PVKKHPLVEG 
101 GYHPHIVYRR QKVPETKEPT CGL (SEQ ID NO:5) 



Name: SVPH-4 (polypeptide) 



1 HEDLYLQRKY NDALAWSFGK VCSLEYAGSV STLLDTNILA PATWSAHELG 

51 HAVGMSHDEQ YCQCRGRPNC IMGSGRTGFS NCSYISFFKH ISSGATCLNN 

101 I PGLGYVLKR CGNKIVEDNE ECDCGSTEEC QKDRCCQSNC KLQPGANCSI 

151 GLCCHDCRFR PSGYVCRQEG NECDLAEYCD GNSSSCPNDV YKQDGT PC K Y 

2 01 EGRCFRKGCR SRYMQCQSIF GPDAMEAPSE CYDAVNLIGD QFGNCEITGI 
251 RNFKKCESAN SICGRLQCIN VETI PDLPEH TTIISTHLQA ENLMCWGTGY 
301 HLSMKPMGIP DLGMINDGTS CGEGRVCFKK NCVNSSVLQF DCLPEKCNTR 

3 51 GVCNNRKNCH CMYGWAPPFC EEVGYGGSID SGPPGLLRGA IPSSIWWSI 
401 IMFRLILLIL SWFVFFRQV IGNHLKPKQE KMPLSKAKTE QEESKTKTVQ 
451 EESKTKTGQE ESEAKTGQEE SKAKTGQEES KANIESKRPK AKS VKKQKK * 

(SEQ ID NO:6) 
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5 Name: SVPH-la (polypeptide) 

1 MKMLLLLHCL GVFLSCSGHI QDEHPQYHSP PDWIPVRIT GTTRGMTPPG 

51 WLSYILPFGG QKHIIHIKVK KLLFSKHLPV FTYTDQGAIL EDQPFVQNNC 

101 YYHGYVEGDP ESLVSLSTCF GGFQGILQIN DFAYEIKPLA FSTTFEHLVY 

10 151 KMDSEEKQFS TMRSGFMQNE ITCRMEFEEI DNSTQKQSSY VGWWIHFRIV 

201 EIVWIDNYL YIRYERNDSK LLEDLYVIVN IVDSILDVIG VKVLLFGLEI 

251 WTNKNLIWD DVRKSVHLYC KWKSENITPR MQHDTSHLFT TLGLRGLSGI 

301 GAFRGMCTPH RSCAIVTFMN KTLGTFSIAV AHHLGHNLGM NHDEDTCRCS 

3 51 QPRC IMHEGN PPITKFSNCS YGDFWEYTVE RTKCLLETVH TKDI FNVKRC 

15 401 GNGWEEGEE CDCGPLKHCA KDPCCLSNCT LTDGSTCAFG LCCKDCKFLP 

451 SGKVC RKEVN ECDLPEWCNG TSHKC PDDFY VEDGI PCKER GYCYEKSCHD 

501 RNEQCRRIFG AGANTAS ETC YKELNTLGDR VGHCG I KNAT YIKCNISDVQ 

551 CGRIQCENVT EIPNMSDHTT VHWARFND I M CWSTDYHLGM KGPDIGEVKD 

601 GTECGIDHIC IHRHCVHITI LNSNCSPAFC NKRG I CNNKH HCHCNYLWDP 

20 651 PNCLIKGYGG SVDSGPPPKR KKKKKFCYLC ILLLIVLFIL LCCLYRLCKK 

7 01 SKPIKKQQDV QTPSAKEEEK IQRRPHELPP QSQPWVMPSQ SQPPVTPSQR 

751 QPQLMPSQSQ PPVTPS* (SEQIDNO:12) 



25 



Name: SVPH-lb (polypeptide) 



1 MKMLLLLHCL GVFLSCSGHI QDEHPQYHSP PDWIPVRIT GTTRGMTPPG 

51 WLSYILPFGG QKHIIHIKVK KLLFSKHLPV FTYTDQGAIL EDQPFVQNNC 

101 YYHGYVEGDP ESLVSLSTCF GGFQGILQIN DFAYEIKPLA FSTTFEHLVY 

151 KMDSEEKQFS TMRSGFMQNE ITCRMEFEEI DNSTQKQSSY VGWWIHFRIV 

30 201 EIVWIDNYL YIRYERNDSK LLEDLYVIVN IVDSILDVIG VKVLLFGLEI 

251 WTNKNLIWD DVRKSVHLYC KWKSENITPR MQHDTSHLFT TLGLRGLSGI 

3 01 GAFRGMCTPH RSCAIVTFMN KTLGTFSIAV AHHLGHNLGM NHDEDTCRCS 

3 51 QPRC IMHEGN PPITKFSNCS YGDFWEYTVE RTKCLLETVH TKDI FNVKRC 

401 GNGWEEGEE CDCGPLKHCA KDPCCLSNCT LTDGSTCAFG LCCKDCKFLP 

35 451 SGKVCRKEVN ECDLPEWCNG TSHKC PDDFY VEDGI PCKER GYCYEKSCHD 

501 RNEQCRRIFG AGANTASETC YKELNTLGDR VGHCGIKNAT YIKCNISDVQ 

551 CGRIQCENVT EIPNMSDHTT VHWARFND I M CWSTDYHLGM KGPDIGEVKD 

601 GTECGIDHIC IHRHCVHITI LNSNCSPAFC NKRG I CNNKH HCHCNYLWDP 

651 PNCLIKGYGG SVDSGPPPKR KKKKKFCYLC ILLLIVLFIL LCCLYRLCKK 

40 701 SKPIKKQQDV QTPSAKEEEK IQRRPHELPP QSQPWVMPSQ SQPPVTPSQS 

751 HPQVMPSQSQ PPQNLFLFSF SISDCVLNFR LLYLQAT* (SEQ ID NO: 13) 

Name: SVPH-lc (polypeptide) 

45 1 MKMLLLLHCL GVFLSCSGHI QDEHPQYHSP PDWIPVRIT GTTRGMTPPG 

51 WLSYILPFGG QKHIIHIKVK KLLFSKHLPV FTYTDQGAIL EDQPFVQNNC 

101 YYHGYVEGDP ESLVSLSTCF GGFQGILQIN DFAYEIKPLA FSTTFEHLVY 

151 KMDSEEKQFS TMRSGFMQNE ITCRMEFEEI DNSTQKQSSY VGWWIHFRIV 

201 EIVWIDNYL YIRYERNDSK LLEDLYVIVN IVDSILDVIG VKVLLFGLEI 

50 2 51 WTNKNLIWD DVRKSVHLYC KWKSENITPR MQHDTSHLFT TLGLRGLSGI 

3 01 GAFRGMCTPH RSCAIVTFMN KTLGTFSIAV AHHLGHNLGM NHDEDTCRCS 

3 51 QPRC IMHEGN PPITKFSNCS YGDFWEYTVE RTKCLLETVH TKDI FNVKRC 

401 GNGWEEGEE CDCGPLKHCA KDPCCLSNCT LTDGSTCAFG LCCKDCKFLP 

451 SGKVCRKEVN ECDLPEWCNG TSHKC PDDFY VEDGI PCKER GYCYEKSCHD 

55 501 RNEQCRRIFG AGANTASETC YKELNTLGDR VGHCGIKNAT YIKCNISDVQ 

551 CGRIQCENVT EIPNMSDHTT VHWARFND I M CWSTDYHLGM KGPDIGEVKD 

601 GTECGIDHIC IHRHCVHITI LNSNCSPAFC NKRG I CNNKH HCHCNYLWDP 

651 PNCLIKGYGG SVDSGPPPKR KKKKKFCYLC ILLLIVLFIL LCCLYRLCKK 

701 SKPIKKQQDV QTPSAKEEEK IQRRPHELPP QSQPWVMPSQ SQPPVTPSQS 

60 751 HPRVMPSQSQ PPVMPSQSHP QLTPSQSQPP VMPSQSHPQL TPSQSQPPVT 

801 PSQRQPQLMP SQSQPPVTPS * (SEQIDNO:14) 

Name: SVPH-4a (polypeptide) 

65 1 MRSVQIFLSQ CRLLLLLVPT MLLKSLGEDV IFHPEGEFDS YEVTIPEKLS 

51 FRGEVQGWS PVSYLLQLKG KKHVLHLWPK RLLLPRHLRV FSFTEHGELL 

101 EDHPYI PKDC NYMGSVKESL DSKATI STCM GGLRGVFNID AKHYQIEPLK 

151 ASPSFEHWY LLKKEQFGNQ VCGLSDDEIE WQMAPYENKA RLRDFPGSYK 
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201 HPKYLELILL FDQSRYRFVN NNLSQVIHDA ILLTGIMDTY FQDVRMRIHL 

251 KALEVWTDFN KIRVGYPELA EVLGRFVIYK KSVLNARLSS DWAHLYLQRK 

3 01 YNDALAWSFG KVCSLEYAGS VSTLLDTNIL APATWSAHEL GHAVGM SHDE 

351 QYCQCRGRPN CIMGSGRTGF SNCSYISFFK HISSGATCLN NIPGLGYVLK 

401 RCGNKIVEDN EECDCGSTEE CQKDRCCQSN CKLQPGANCS IGLCCHDCRF 

451 RPSGYVCRQE GNECDLAEYC DGNSSSCPND VYKQDGTPCK YEGRCFRKGC 

501 RSRYMQCQSI FGPDAMEAPS ECYDAVNLIG DQFGNCEITG IRNFKKCESA 

551 NSICGRLQCI NVETIPDLPE HTTIISTHLQ AENLMCWGTG YHLSMKPMGI 

601 PDLGMINDGT SCGEGRVCFK KNCVNSSVLQ FDCLPEKCNT RGVCNNRKNC 

651 HCMYGWAPPF CEEVGYGGSI DSGPPGLLRG AIPSSIWWS IIMFRLILLI 

701 LSWFVFFRQ VIGNHLKPKQ EKMPLSKAKT EQEESKTKTV QEESKTKTGQ 

751 EESEAKTGQE ESKAKTGQEE SKANIESKRP KAKSVKKQKK * 
(SEQIDN0:15) 



Name: SVPH-4b (polypeptide) 

1 MRSVQIFLSQ CRLLLLLVPT MLLKSLGEDV IFHPEGEFDS YEVTIPEKLS 

51 FRGEVQGWS PVSYLLQLKG KKHVLHLWPK RLLLPRHLRV FSFTEHGELL 

101 EDHPYIPKDC NYMGSVKESL DSKATISTCM GGLRGVFNID AKHYQIEPLK 

151 ASPSFEHWY LLKKEQFGNQ VCGLSDDEIE WQMAPYENKA RLRDFPGSYK 

201 HPKYLELILL FDQSRYRFVN NNLSQVIHDA ILLTGIMDTY FQDVRMRIHL 

251 KALEVWTDFN KIRVGYPELA EVLGRFVIYK KSVLNARLSS DWAHLYLQRK 

3 01 YNDALAWSFG KVCSLEYAGS VSTLLDTNIL APATWPAHEL GHAVGMSHDE 

351 QYCQCRGRLN CIMGSGRTGF SNCSYISFFK HISSGATCLN NIPGLGYVLK 

401 RCGNKIVEDN EECDCGSTEE CQKDRCCQSN CKLQPGANCS IGLCCHDCRF 

451 RPSGYVCRQE GNECDLAEYC DGNSSSCPND VYKQDGTPCK YEGRCFRKGC 

501 RSRYMQCQSI FGPDAMEAPS EC YDAVNL I G DQFGNCEITG IRNFKKCESA 

551 NSICGRLQCI NVETIPDLPE HTTIISTHLQ AENLMCWGTG YHLSMKPMGI 

601 PDLGMINDGT SCGEGRVCFK KNCVNSSVLQ FDCLPEKCNT RGVCNNRKNC 

651 HCMYGWAPPF CEEVGYGGSI DSGPPGLLRG AIPSSIWWS IIMFRLILLI 

7 01 LSWFVFFRQ VIGNHLKPKQ EKMPLSKAKT EQEESKTKTV QEESKTKTGQ 

751 EESEAKTGQE ESKANIESKR PKAKSVKKQK K* (SEQIDNO:16) 

The discovery of the nucleic acids of the invention enables the construction of expression 
vectors comprising nucleic acid sequences encoding polypeptides; host cells transfected or transformed 
with the expression vectors; isolated and purified biologically active polypeptides and fragments 
thereof; the use of the nucleic acids or oligonucleotides thereof as probes to identify nucleic acid 
encoding proteins having metalloproteinase-disintegrin activity; the use of the nucleic acids or 
oligonucleotides thereof to identify human chromosome number 1 or 4; the use of the nucleic acids or 
oligonucleotides thereof to map genes on human chromosome number 1 or 4; the use of the nucleic 
acid or oligonucleotides thereof to identify genes associated with certain diseases, syndromes or other 
human conditions associated with human chromosome number 1 or 4, including fetal hydantoin 
syndrome, diphenylhydantoin toxicity, and pheochromocytoma; the use of single-stranded sense or 
antisense oligonucleotides from the nucleic acids to inhibit expression of polynucleotide encoded by 
the SVPH-1, SVPH-3, or SVPH-4 gene; the use of such polypeptides and soluble fragments to function 
as a proteinase; the use of such polypeptides and fragmented peptides as molecular weight markers; the 
use of such polypeptides and fragmented peptides as controls for peptide fragmentation, and kits 
comprising these reagents; the use of such polypeptides and fragments thereof to generate antibodies; 
and the use of antibodies to purify SVPH polypeptides. 
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5 NUCLEIC ACID MOLECULES 

In a particular embodiment, the invention relates to certain isolated nucleotide sequences that 
are free from contaminating endogenous material. A "nucleotide sequence" refers to a polynucleotide 
molecule in the form of a separate fragment or as a component of a larger nucleic acid construct. The 
nucleic acid molecule has been derived from DNA or RNA isolated at least once in substantially pure 
10 form and in a quantity or concentration enabling identification, manipulation, and recovery of its 
component nucleotide sequences by standard biochemical methods, such as those outlined in Sambrook 
et al., Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY (1989). Such sequences are preferably provided and/or constructed in the form of an open 
reading frame uninterrupted by internal non-translated sequences, or introns, that are typically present 
15 in eukaryotic genes. Sequences of non-translated DNA can be present 5' or 3' from an open reading 
frame, where the same do not interfere with manipulation or expression of the coding region. 

Nucleic acid molecules of the invention include DNA in both single-stranded and double- 
stranded form, as well as the RNA complement thereof. DNA includes, for example, cDNA, genomic 
DNA, chemically synthesized DNA, DNA amplified by PCR, and combinations thereof. Genomic 
20 DNA may be isolated by conventional techniques, e.g., using the cDNA of SEQ ID NOs:l-3 and 7-11, 
or a suitable fragment thereof, as a probe. 

The DNA molecules of the invention include full length genes as well as polynucleotides and 
fragments thereof. The full length gene may include the N-terminal signal peptide. Other 
embodiments include DNA encoding a soluble form, e.g., encoding the extracellular domain of the 
25 protein, either with or without the signal peptide. 

The nucleic acids of the invention are preferentially derived from human sources, but the 
invention includes those derived from no n- human species, as well. 

Preferred Sequences 

30 Particularly preferred nucleotide sequences of the invention are SEQ ID NOs: 1-3 and 7-1 1, as 

set forth above. The sequences of amino acids encoded by the DNA of SEQ ID NOs: 1-3 and 7-1 1 are 
shown in SEQ ID NOs:4-6 and 12-16, respectively. In SEQ ID NO:l "N" can represent any 
nucleotide. These sequences identify the SVPH polynucleotides as members of the metal loproteinase- 
disintegrin family. As noted above, proteins of this family are characterized by a pro-domain, a 

35 disintegrin domain, a metalloproteinase domain, a cysteine rich region, a transmembrane domain, and a 
cytoplasmic tail. 

In particular, SVPH- 1 (originally isolated from human testis) and S VPH-4 (originally isolated 
from human testis, fetal lung, and B-cells) both share homology to the cysteine rich region of the 
metalloproteinase-disintegrin family, and SVPH-3 (originally isolated from human fetus tissue) shares 
40 homology to the pro-domain of these family members. In addition, SVPH-4 polypeptide (SEQ ID 
NO:3) encodes a zinc binding motif (His 47 to Asp 58), a disintegrin domain (Leu 104 to Cys 179), and 
a cysteine rich region (Asp 180 to Arg 388). 
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SVPH-la, SVPH-lb, and SVPH-lc represent the nucleotide sequences (SEQ ID NOs:7-9) of 
three alternatively spliced SVPH-1 clones with divergent cytoplasmic domains. These clones were 
isolated by screening a human testis library (Clonetech cat no. HL3024a) at 42°C and washing at 42°C 

> 

in 2x SSC using four different oligonucleotides: 
CACCTAAGGTGTTCAATTCTTTG (SEQ ID NO: 17), 

CAAATACTGCAAGTGAGACTTGC (SEQ ID NO: 18), TGCACAACTACGTGTGGTGTACCC 
(SEQ ID NO: 19), and GAGCCACTGCAATTGAAAAAGTGCCC (SEQ ID NO:20). 

SVPH-4a and SVPH-4b represent the nucleotide sequences (SEQ ID NOs:10-ll) of two 
alternatively spliced SVPH-4 clones with divergent cytoplasmic domains. These clones were isolated 
by screening a human testis library (Clonetech cat no. HL3024a) at 42°C and washing at 42°C in 2x 
SSC using three different oligonucleotides: 
AATGATGCTCTTGCATGGTCG (SEQ ID NO:21), 
CTTTCACGGAGCCCATGTAGTTGCAG (SEQ ID NO:22), and 
TGAAGGAGAAAACGCGCAGATGTCGG (SEQ ID NO:23). 

Additional Sequences 

Due to the known degeneracy of the genetic code, wherein more than one codon can encode 
the same amino acid, a DNA sequence can vary from that shown in SEQ ID NOs:l-3 and 7-11, and 
still encode a polypeptide having the amino acid sequence of SEQ ID NOs:4-6 and 12-16, respectively. 
Such variant DNA sequences 

can result from silent mutations (e.g., occurring during PCR amplification), or can be the product of 
deliberate mutagenesis of a native sequence. 

The invention thus provides isolated DNA sequences encoding polypeptides of the invention, 
selected from: (a) DNA comprising the nucleotide sequence of SEQ ID NOs:l-3 and 7-11; (b) DNA 
encoding the polypeptides of SEQ ID NOs:4-6 and 12-16; (c) DNA capable of hybridization to a DNA 
of (a) or (b) under conditions of moderate stringency and which encodes polypeptides of the invention; 
(d) DNA capable of hybridization to a DNA of (a) or (b) under conditions of high stringency and which 
encodes polypeptides of the invention, and (e) DNA which is degenerate as a result of the genetic code 
to a DNA defined in (a), (b), (c), or (d) and which encode polypeptides of the invention. Of course, 
polypeptides encoded by such DNA sequences are encompassed by the invention. 

As used herein, conditions of moderate stringency can be readily determined by those having 
ordinary skill in the art based on, for example, the length of the DNA. The basic conditions are set 
forth by Sambrook et al. Molecular Cloning: A Laboratory Manual, 2nd ed. Vol. 1, pp. 1.101-104, 
Cold Spring Harbor Laboratory Press, (1989), and include use of a prewashing solution for the 
nitrocellulose filters 5X SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0), hybridization conditions of about 
50% formamide, 6X SSC at about 42°C (or other similar hybridization solution, such as Stark's 
solution, in about 50% formamide at about 42°C), and washing conditions of about 60°C, 0.5X SSC, 
0.1% SDS. Conditions of high stringency can also be readily determined by the skilled artisan based 
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5 on, for example, the length of the DNA. Generally, such conditions are defined as hybridization 
conditions as above, and with washing at approximately 68°C, 0.2X SSC, 0.1% SDS. The skilled 
artisan will recognize that the temperature and wash solution salt concentration can be adjusted as 
necessary according to factors such as the length of the probe. 

Also included as an embodiment of the invention is DNA encoding polypeptide fragments and 

10 polypeptides comprising inactivated N-glycosylation site(s), inactivated protease processing site(s), or 
conservative amino acid substitution(s), as described below. 

In another embodiment, the nucleic acid molecules of the invention also comprise nucleotide 
sequences that are at least 80% identical to a native sequence. Also contemplated are embodiments in 
which a nucleic acid molecule comprises a sequence that is at least 90% identical, at least 95% 

15 identical, at least 98% identical, at least 99% identical, or at least 99.9% identical to a native sequence. 

The percent identity may be determined by visual inspection and mathematical calculation. 
Alternatively, the percent identity of two nucleic acid sequences can be determined by comparing 
sequence information using the GAP computer program, version 6.0 described by Devereux et al., 
Nucl. Acids Res., 12:387 (1984) and available from the University of Wisconsin Genetics Computer 

20 Group (UWGCG). The preferred default parameters for the GAP program include: (1) a unary 
comparison matrix (containing a value of 1 for identities and 0 for non- identities) for nucleotides, and 
the weighted comparison matrix of Gribskov and Burgess, Nucl. Acids Res. 74:6745 (1986), as 
described by Schwartz and Dayhoff, eds., Adas of Protein Sequence and Structure, National 
Biomedical Research Foundation, pp. 353-358 (1979); (2) a penalty of 3.0 for each gap and an 

25 additional 0.10 penalty for each symbol in each gap; and (3) no penalty for end gaps. Other programs 
used by one skilled in the art of sequence comparison may also be used. 

The invention also provides isolated nucleic acids useful in the production of polypeptides. 
Such polypeptides may be prepared by any of a number of conventional techniques. A DNA sequence 
encoding an SVPH polypeptide, or desired fragment thereof may be subcloned into an expression 

30 vector for production of the polypeptide or fragment. The DNA sequence advantageously is fused to a 
sequence encoding a suitable leader or signal peptide. Alternatively, the desired fragment may be 
chemically synthesized using known techniques. DNA fragments also may be produced by restriction 
endonuclease digestion of a full length cloned DNA sequence, and isolated by electrophoresis on 
agarose gels. If necessary, oligonucleotides that reconstruct the 5' or 3' terminus to a desired point may 

35 be Iigated to a DNA fragment generated by restriction enzyme digestion. Such oligonucleotides may 
additionally contain a restriction endonuclease cleavage site upstream of the desired coding sequence, 
and position an initiation codon (ATG) at the N-terminus of the coding sequence. 

The well-known polymerase chain reaction (PCR) procedure also may be employed to isolate 
and amplify a DNA sequence encoding a desired protein fragment. Oligonucleotides that define the 

40 desired termini of the DNA fragment are employed as 5' and 3' primers. The oligonucleotides may 
additionally contain recognition sites for restriction endonucleases, to facilitate insertion of the 
amplified DNA fragment into an expression vector. PCR techniques are described in Saiki et al., 
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5 Science 239:487 (1988); Recombinant DNA Methodology, Wu et al., eds., Academic Press, Inc., San 
Diego, pp. 189-196 (1989); and PCR Protocols: A Guide to Methods and Applications, Innis et al., 
eds., Academic Press, Inc. (1990). 

POLYPEPTIDES AND FRAGMENTS THEREOF 
10 The invention encompasses polypeptides and fragments thereof in various forms, including 

those that are naturally occurring or produced through various techniques such as procedures involving 

recombinant DNA technology. Such forms include, but are not limited to, derivatives, variants, and 

oligomers, as well as fusion proteins or fragments thereof. 
Polypeptides and Fragments Thereof 
15 The polypeptides of the invention include the proteins encoded by the nucleic acid sequences 

set forth above. Particularly preferred polypeptides comprise the amino acid sequence of SEQ ID 

NOs:4-6 and 12-16. 

The polypeptides, as set forth in SEQ ID NOs:4 and 6, include a cysteine rich region 
homologous to the metalloproteinase-disintegrin family, and the polypeptide of SEQ ID NO:5 includes 

20 a pro-domain homologous to the same family of proteins. SVPH-1 (SEQ ID NO:4) has an N-terminal 
region having amino acids Met 1 to Asn 40. In SEQ ID NO:4 "X" can represent any amino acid. 
SVPH-3 (SEQ ID NO:5) has an N-terminal region having amino acids Asn 1 to Leu 23. SVPH-4 (SEQ 
ID NO:6) also includes an extracellular domain comprising amino acids His 1 to Arg 388, a 
transmembrane region comprising amino acids Gly 389 through Phe 417, and a C-terminal cytoplasmic 

25 domain comprising amino acids Arg 418 to Lys 499 and is believed to overlap with EST designated 
AA 782936. 

The SVPH-1 a polypeptide (SEQ ID NO: 12), SVPH-lb polypeptide (SEQ ID NO: 13), and 
SVPH-lc polypeptide (SEQ ID NO: 14) each encodes a signal sequence (Met 1 to Ser 15), a pro- 
domain (Cys 16 to Ser 188), a catalytic domain (Ser 189 to Thr 388), a disintegrin domain (Val 389 to 

30 Gly 491), a cysteine rich region (Tyr 492 to Lys 675), and a transmembrane domain (Phe 676 to Cys 
698). In addition, each of the SVPH-la, SVPH-lb, and SVPH-lc polypeptides (SEQ ID NOs:12-14) 
encodes a cytoplasmic domain. Due to alternative splicing the cytoplasmic domain of each polypeptide 
is different. For SVPH-la, SVPH-lb, and SVPH-lc the cytoplasmic domains are (Lys 699 to Ser 766), 
(Lys 699 to Thr 787), and (Lys 699 to Ser 820), respectively. 

35 Similarly, the SVPH-4a polypeptide (SEQ ID NO: 15) and SVPH-4b polypeptide (SEQ ID 

NO: 16) each encodes a signal sequence (Met 1 to Gly 27), a pro-domain (Glu 28 to Arg 193), a 
catalytic domain (Asp 194 to He 392), a disintegrin domain (Pro 393 to Gly 493), a cysteine rich region 
(Arg 494 to Ser 685), and a transmembrane domain (He 686 to Gly 713). In addition, each of the 
SVPH-4a and SVPH-4b polypeptides (SEQ ID NOs: 15-16) encodes a cytoplasmic domain. Due to 

40 alternative splicing the cytoplasmic domain of each polypeptide is different. The cytoplasmic domain 
of SVPH-4a is (Asn 714 to Lys 790), and the cytoplasmic domain of SVPH-4b is (Asn 714 to Lys 
781). 
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5 The skilled artisan will recognize that the above-described boundaries of such regions of the 

polypeptide are approximate and that the boundaries of the transmembrane region (which may be 
predicted by using computer programs available for that purpose) may differ from those described 
above. 

The polypeptides of the invention may be membrane bound or they may be secreted and thus 
10 soluble. Soluble polypeptides are capable of being secreted from the cells in which they are expressed. 
In general, soluble polypeptides may be identified (and distinguished from non-soluble membrane- 
bound counterparts) by separating intact cells which express the desired polypeptide from the culture 
medium, e.g., by centrifugation, and assaying the medium (supernatant) for the presence of the desired 
polypeptide. The presence of polypeptide in the medium indicates that the polypeptide was secreted 
1 5 from the cells and thus is a soluble form of the protein. 

In one embodiment, the soluble polypeptides and fragments thereof comprise all or part of the 
extracellular domain, but lack the transmembrane region that would cause retention of the polypeptide 
on a cell membrane. A soluble polypeptide may include the cytoplasmic domain, or a portion thereof, 
as long as the polypeptide is secreted from the cell in which it is produced. 
20 In general, the use of soluble forms is advantageous for certain applications. Purification of 

the polypeptides from recombinant host cells is facilitated, since the soluble polypeptides are secreted 
from the cells. Further, soluble polypeptides are generally more suitable for intravenous 
administration. 

The invention also provides polypeptides and fragments of the extracellular domain that retain 
25 a desired biological activity. Particular embodiments are directed to polypeptide fragments that retain 
the ability to bind the "binding partner" or the native cognates, substrates, or counter-structure. Such a 
fragment may be a soluble polypeptide, as described above. In another embodiment, the polypeptides 
and fragments advantageously include regions that are conserved in the SVPH family as described 
above. 

30 Also provided herein are polypeptide fragments comprising at least 20, or at least 30, 

contiguous amino acids of the sequences of SEQ ID NOs:4-6 and 12-16. Fragments derived from the 
cytoplasmic domain find use in studies of signal transduction, and in regulating cellular processes 
associated with transduction of biological signals. Polypeptide fragments also may be employed as 
immunogens, in generating antibodies. 

35 

Variants 

Naturally occurring variants as well as derived variants of the polypeptides and fragments are 
provided herein. 

Variants may exhibit amino acid sequences that are at least 80% identical. Also contemplated 
40 are embodiments in which a polypeptide or fragment comprises an amino acid sequence that is at least 
90% identical, at least 95% identical, at least 98% identical, at least 99% identical, or at least 99.9% 
identical to the preferred polypeptide or fragment thereof. Percent identity may be determined by 



18 



Metalloproteinase-Disintegrin Polypeptides 25 17 -USB 



5 visual inspection and mathematical calculation. Alternatively, the percent identity of two protein 
sequences can be determined by comparing sequence information using the GAP computer program, 
based on the algorithm of Needleman and Wunsch, J. Mot. BioL, 48:443 (1970) and available from the 
University of Wisconsin Genetics Computer Group (UWGCG). The preferred default parameters for 
the GAP program include: (1) a scoring matrix, blosum62, as described by Henikoff and Henikoff, 
10 Proc. Natl Acad. Set USA, 89:10915 (1992); (2) a gap weight of 12; (3) a gap length weight of 4; and 
(4) no penalty for end gaps. Other programs used by one skilled in the art of sequence comparison may 
also be used. 

The variants of the invention include, for example, those that result from alternate mRNA 
splicing events or from proteolytic cleavage. Alternate splicing of mRNA may, for example, yield a 

15 truncated but biologically active protein, such as a naturally occurring soluble form of the protein. 
Variations attributable to proteolysis include, for example, differences in the N- or C-termini upon 
expression in different types of host cells, due to proteolytic removal of one or more terminal amino 
acids from the protein (generally from 1-5 terminal amino acids). Proteins in which differences in 
amino acid sequence are attributable to genetic polymorphism (allelic variation among individuals 

20 producing the protein) are also contemplated herein. 

Additional variants within the scope of the invention include polypeptides that may be 
modified to create derivatives thereof by forming covalent or aggregative conjugates with other 
chemical moieties, such as glycosyl groups, lipids, phosphate, acetyl groups and the like. Covalent 
derivatives may be prepared by linking the chemical moieties to functional groups on amino acid side 

25 chains or at the N-terminus or C-terminus of a polypeptide. Conjugates comprising diagnostic 
(detectable) or therapeutic agents attached thereto are contemplated herein, as discussed in more detail 
below. 

Other derivatives include covalent or aggregative conjugates of the polypeptides with other 
proteins or polypeptides, such as by synthesis in recombinant culture as N- terminal or C-terminal 

30 fusions. Examples of fusion proteins are discussed below in connection with oligomers. Further, 
fusion proteins can comprise peptides added to facilitate purification and identification. Such peptides 
include, for example, poly-His or the antigenic identification peptides described in U.S. Patent No. 
5,01 1,912 and in Hopp et al., Bio/Technology 6:1204 (1988). One such peptide is the FLAG® peptide, 
Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys (SEQ ID NO:28), which is highly antigenic and provides an 

35 epitope reversibly bound by a specific monoclonal antibody, enabling rapid assay and facile 
purification of expressed recombinant protein. A murine hybridoma designated 4E11 produces a 
monoclonal antibody that binds the FLAG® peptide in the presence of certain divalent metal cations, as 
described in U.S. Patent 5,011,912, hereby incorporated by reference. The 4E11 hybridoma cell line 
has been deposited with the American Type Culture Collection under accession no. HB 9259. 

40 Monoclonal antibodies that bind the FLAG® peptide are available from Eastman Kodak Co., Scientific 
Imaging Systems Division, New Haven, Connecticut. 
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5 Among the variant polypeptides provided herein are variants of native polypeptides that retain 

the native biological activity or the substantial equivalent thereof. One example is a variant that binds 
with essentially the same binding affinity as does the native form. Binding affinity can be measured by 
conventional procedures, e.g., as described in U.S. Patent No. 5,512,457 and as set forth below. 

Variants include polypeptides that are substantially homologous to the native form, but which 
10 have an amino acid sequence different from that of the native form because of one or more deletions, 
insertions or substitutions. Particular embodiments include, but are not limited to, polypeptides that 
comprise from one to ten deletions, insertions or substitutions of amino acid residues, when compared 
to a native sequence. 

A given amino acid may be replaced, for example, by a residue having similar physiochemical 
15 characteristics. Examples of such conservative substitutions include substitution of one aliphatic 
residue for another, such as lie, Val, Leu, or Ala for one another; substitutions of one polar residue for 
another, such as between Lys and Arg, Glu and Asp, or Gin and Asn; or substitutions of one aromatic 
residue for another, such as Phe, Trp, or Tyr for one another. Other conservative substitutions, e.g., 
involving substitutions of entire regions having similar hydrophobicity characteristics, are well known. 
20 Similarly, the DNAs of the invention include variants that differ from a native DNA sequence 

because of one or more deletions, insertions or substitutions, but that encode a biologically active 
polypeptide. 

The invention further includes polypeptides of the invention with or without associated native- 
pattern glycosylation. Polypeptides expressed in yeast or mammalian expression systems (e.g., COS-1 

25 or COS-7 cells) can be similar to or significantly different from a native polypeptide in molecular 
weight and glycosylation pattern, depending upon the choice of expression system. Expression of 
polypeptides of the invention in bacterial expression systems, such as E. coli, provides non- 
glycosylated molecules. Further, a given preparation may include multiple differentially glycosylated 
species of the protein. Glycosyl groups can be removed through conventional methods, in particular 

30 those utilizing glycopeptidase. In general, glycosylated polypeptides of the invention can be incubated 
with a molar excess of glycopeptidase (Boehringer Mannheim). 

Correspondingly, similar DNA constructs that encode various additions or substitutions of 
amino acid residues or sequences, or deletions of terminal or internal residues or sequences are 
encompassed by the invention. For example, N-glycosylation sites in the polypeptide extracellular 

35 domain can be modified to preclude glycosylation, allowing expression of a reduced carbohydrate 
analog in mammalian and yeast expression systems. N-glycosylation sites in eukaryotic polypeptides 
are characterized by an amino acid triplet Asn-X-Y, wherein X is any amino acid except Pro and Y is 
Ser or Thr. Appropriate substitutions, additions, or deletions to the nucleotide sequence encoding these 
triplets will result in prevention of attachment of carbohydrate residues at the Asn side chain. 

40 Alteration of a single nucleotide, chosen so that Asn is replaced by a different amino acid, for example, 
is sufficient to inactivate an N-glycosylation site. Alternatively, the Ser or Thr can by replaced with 
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5 another amino acid, such as Ala. Known procedures for inactivating N-glycosylation sites in proteins 
include those described in U.S. Patent 5,071,972 and EP 276,846, hereby incorporated by reference. 

In another example of variants, sequences encoding Cys residues that are not essential for 
biological activity can be altered to cause the Cys residues to be deleted or replaced with other amino 
acids, preventing formation of incorrect intramolecular disulfide bridges upon folding or renaturation. 

10 Other variants are prepared by modification of adjacent dibasic amino acid residues, to 

enhance expression in yeast systems in which KEX2 protease activity is present. EP 212,914 discloses 
the use of site-specific mutagenesis to inactivate KEX2 protease processing sites in a protein. KEX2 
protease processing sites are inactivated by deleting, adding or substituting residues to alter Arg-Arg, 
Arg-Lys, and Lys-Arg pairs to eliminate the occurrence of these adjacent basic residues. Lys-Lys 

15 pairings are considerably less susceptible to KEX2 cleavage, and conversion of Arg-Lys or Lys-Arg to 
Lys-Lys represents a conservative and preferred approach to inactivating KEX2 sites. 

Oligomers 

Encompassed by the invention are oligomers or fusion proteins that contain SVPH 
20 polypeptides. When the polypeptide of the invention is a type I membrane protein, such as SVPH, the 
fusion partner is linked to the C terminus of the type I membrane protein. Such oligomers may be in the 
form of covalently-Hnked or non-covalently-linked multimers, including dimers, trimers, or higher 
oligomers. As noted above, preferred polypeptides are soluble and thus these oligomers may comprise 
soluble polypeptides. In one aspect of the invention, the oligomers maintain the binding ability of the 
25 polypeptide components and provide therefor, bivalent, trivalent, etc., binding sites. 

One embodiment of the invention is directed to oligomers comprising multiple polypeptides 
joined via covalent or non-covalent interactions between peptide moieties fused to the polypeptides. 
Such peptides may be peptide linkers (spacers), or peptides that have the property of promoting 
oligomerization. Leucine zippers and certain polypeptides derived from antibodies are among the 
30 peptides that can promote oligomerization of the polypeptides attached thereto, as described in more 
detail below. 

Immunoglobulin-based Oligomers 
As one alternative, an oligomer is prepared using polypeptides derived from 
immunoglobulins. Preparation of fusion proteins comprising certain heterologous polypeptides fused 
35 to various portions of antibody-derived polypeptides (including the Fc domain) has been described, 
e.g., by Ashkenazi et al., PNAS USA, 88:10535 (1991); Byrn et al., Nature, 344:677 (1990); and 
Hollenbaugh and Aruffo, "Construction of Immunoglobulin Fusion Proteins", Current Protocols in 
Immunology, Suppl. 4, pages 10.19.1 - 10.19.11 (1992). 

One embodiment of the present invention is directed to a dimer comprising two fusion 
40 proteins created by fusing a polypeptide of the invention to an Fc polypeptide derived from an 
antibody. A gene fusion encoding the polypeptide/Fc fusion protein is inserted into an appropriate 
expression vector. Polypeptide/Fc fusion proteins are expressed in host cells transformed with the 
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5 recombinant expression vector, and allowed to assemble much like antibody molecules, whereupon 
interchain disulfide bonds form between the Fc moieties to yield divalent molecules. 

The term "Fc polypeptide" as used herein includes native and mutein forms of polypeptides 
made up of the Fc region of an antibody comprising any or all of the CH domains of the Fc region. 
Truncated forms of such polypeptides containing the hinge region that promotes dimerization are also 
10 included. Preferred polypeptides comprise an Fc polypeptide derived from a human IgGl antibody. 

One suitable Fc polypeptide, described in PCT application WO 93/10151 (hereby incorporated 
by reference), is a single chain polypeptide extending from the N- terminal hinge region to the native C- 
terminus of the Fc region of a human IgGl antibody. Another useful Fc polypeptide is the Fc mutein 
described in U.S. Patent 5,457,035 and in Baum et al., EMBO J. 13:3992-4001 (1994), incorporated 
15 herein by reference. The amino acid sequence of this mutein is identical to that of the native Fc 
sequence presented in WO 93/10151, except that amino acid 19 has been changed from Leu to Ala, 
amino acid 20 has been changed from Leu to Glu, and amino acid 22 has been changed from Gly to 
Ala. The mutein exhibits reduced affinity for Fc receptors. 

The above-described fusion proteins comprising Fc moieties (and oligomers formed 
20 therefrom) offer the advantage of facile purification by affinity chromatography over Protein A or 
Protein G columns. 

In other embodiments, the polypeptides of the invention may be substituted for the variable 
portion of an antibody heavy or light chain. If fusion proteins are made with both heavy and light 
chains of an antibody, it is possible to form an oligomer with as many as four SVPH extracellular 
25 regions. 

Pepti de-linker Based Oligomers 
Alternatively, the oligomer is a fusion protein comprising multiple polypeptides, with or 
without peptide linkers (spacer peptides). Among the suitable peptide linkers are those described in 
30 U.S. Patents 4,751,180 and 4,935,233, which are hereby incorporated by reference. A DNA sequence 
encoding a desired peptide linker may be inserted between, and in the same reading frame as, the DNA 
sequences of the invention, using any suitable conventional technique. For example, a chemically 
synthesized oligonucleotide encoding the linker may be ligated between the sequences. In particular 
embodiments, a fusion protein comprises from two to four soluble SVPH polypeptides, separated by 
35 peptide linkers. 

Leucine-Zippers 

Another method for preparing the oligomers of the invention involves use of a leucine zipper. 
Leucine zipper domains are peptides that promote oligomerization of the proteins in which they are 
40 found. Leucine zippers were originally identified in several DNA-binding proteins (Landschulz et al., 
Science, 240:1759 (1988)), and have since been found in a variety of different proteins. Among the 
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5 known leucine zippers are naturally occurring peptides and derivatives thereof that dimerize or 
trimerize. 

The zipper domain (also referred to herein as an oligomerizing, or oligomer-forming, domain) 
comprises a repetitive heptad repeat, often with four or five leucine residues interspersed with other 
amino acids. Examples of zipper domains are those found in the yeast transcription factor GCN4 and a 

10 heat-stable DNA-binding protein found in rat liver, C/EBP, (Landschulz et al., Science 243:1681 
(1989)). Two nuclear transforming proteins, fos and j«n, also exhibit zipper domains, as does the gene 
product of the murine proto-oncogene, c-myc (Landschulz et al., Science 240:1759 (1988)). The zipper 
domains of fos and jun preferentially form heterodimer (O'Shea et al., Science 245:646 (1989), Turner 
and Tjian, Science, 243:1689 (1989)). The zipper domain is necessary for biological activity (DNA 

15 binding) in these proteins. 

The fusogenic proteins of several different viruses, including paramyxovirus, coronavirus, 
measles virus and many retroviruses, also possess zipper domains (Buckland and Wild, Nature 338:547 
(1989); Britton, Nature, 353:394 (1991); Delwart and Mosialos, AIDS Research and Human 
Retroviruses 6:703 (1990)). The zipper domains in these fusogenic viral proteins are near the 

20 transmembrane region of the proteins; it has been suggested that the zipper domains could contribute to 
the oligomeric structure of the fusogenic proteins. Oligomerization of fusogenic viral proteins is 
involved in fusion pore formation (Spruce et al., Proc. Natl Acad. ScL U.S.A., 88:3523 (1991)). Zipper 
domains have also been recently reported to play a role in oligomerization of heat-shock transcription 
factors (Rabindran et al., Science 259:230 (1993)). 

25 Zipper domains fold as short, parallel coiled coils (O'Shea et al., Science 254:539 (1991)). 

The general architecture of the parallel coiled coil has been well characterized, with a "knobs-into- 
holes" packing as proposed by Crick, Acta Crystallogr., 6:689 (1953). The dimer formed by a zipper 
domain is stabilized by the heptad repeat, designated (abcdefg) Q according to the notation of 
McLachlan and Stewart, J. Mol. BioL, 98:293 (1975), in which residues a and d are generally 

30 hydrophobic residues, with d being a leucine, which line up on the same face of a helix. Oppositely- 
charged residues commonly occur at positions g and e. Thus, in a parallel coiled coil formed from two 
helical zipper domains, the "knobs" formed by the hydrophobic side chains of the first helix are packed 
into the "holes" formed between the side chains of the second helix. 

The residues at position d (often leucine) contribute large hydrophobic stabilization energies, 

35 and are important for oligomer formation (Krystek: et al., Int. J. Peptide Res. 38:229 (1991)). Lovejoy 
et al., Science 259:1288 (1993), recently reported the synthesis of a triple-stranded a-helical bundle in 
which the helices run up-up-down. Their studies confirmed that hydrophobic stabilization energy 
provides the main driving force for the formation of coiled coils from helical monomers. These studies 
also indicate that electrostatic interactions contribute to the stoichiometry and geometry of coiled coils. 

40 Further discussion of the structure of leucine zippers is found in Harbury et al., Science, 262:1401 
(1993). 
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5 Examples of leucine zipper domains suitable for producing soluble oligomeric proteins are 

described in PCT application WO 94/10308, as well as the leucine zipper derived from lung surfactant 
protein D (SPD) described in Hoppe et ah, FEBS Letters, 344:191 (1994), hereby incorporated by 
reference. The use of a modified leucine zipper that allows for stable trimerization of a heterologous 
protein fused thereto is described in Fanslow et al., Semin. Immunol., 6:267-278 (1994). Recombinant 

10 fusion proteins comprising a soluble polypeptide fused to a leucine zipper peptide are expressed in 
suitable host cells, and the soluble oligomer that forms is recovered from the culture supernatant. 

Certain leucine zipper moieties preferentially form trimers. One example is a leucine zipper 
derived from lung surfactant protein D (SPD) noted above, as described in Hoppe et aL, FEBS Letters, 
344:191 (1994) and in U.S. Patent 5,716,805, hereby incorporated by reference in their entirety. This 

1 5 lung SPD-derived leucine zipper peptide comprises the amino acid sequence Pro Asp Val Ala Ser Leu 
Arg Gin Gin Val Glu Ala Leu Gin Gly Gin Val Gin His Leu Gin Ala Ala Phe Ser Gin Tyr (SEQ ID 
NO:29). 

Another example of a leucine zipper that promotes trimerization is a peptide comprising the 

amino acid sequence Arg Met Lys Gin He Glu Asp Lys He Glu Glu He Leu Ser Lys He Tyr His He Glu 
20 Asn Glu He Ala Arg He Lys Lys Leu He Gly Glu Arg (SEQ ID NO:30), as described in U.S. Patent 

5,716,805. In one alternative embodiment, an N-terminal Asp residue is added; in another, the peptide 

lacks the N-terminal Arg residue. 

Fragments of the foregoing zipper peptides that retain the property of promoting 

oligomerization may be employed as well. Examples of such fragments include, but are not limited to, 
25 peptides lacking one or two of the N-terminal or C-terminal residues presented in the foregoing amino 

acid sequences. Leucine zippers may be derived from naturally occurring leucine zipper peptides, e.g., 

via conservative substitution(s) in the native amino acid sequence, wherein the peptide's ability to 

promote oligomerization is retained. 

Other peptides derived from naturally occurring trimeric proteins may be employed in 
30 preparing trimeric SVPH. Alternatively, synthetic peptides that promote oligomerization may be 

employed. In particular embodiments, leucine residues in a leucine zipper moiety are replaced by 

isoleucine residues. Such peptides comprising isoleucine may be referred to as isoleucine zippers, but 

are encompassed by the term "leucine zippers" as employed herein. 

35 PRODUCTION OF POLYPEPTIDES AND FRAGMENTS THEREOF 

Expression, isolation and purification of the polypeptides and fragments of the invention may 
be accomplished by any suitable technique, including but not limited to the following: 

Expression Systems 

40 The present invention also provides recombinant cloning and expression vectors containing 

DNA, as well as host cell containing the recombinant vectors. Expression vectors comprising DNA 
may be used to prepare the polypeptides or fragments of the invention encoded by the DNA. A method 
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5 for producing polypeptides comprises culturing host cells transformed with a recombinant expression 
vector encoding the polypeptide, under conditions that promote expression of the polypeptide, then 
recovering the expressed polypeptides from the culture. The skilled artisan will recognize that the 
procedure for purifying the expressed polypeptides will vary according to such factors as the type of 
host cells employed, and whether the polypeptide is membrane-bound or a soluble form that is secreted 

10 from the host cell. 

Any suitable expression system may be employed. The vectors include a DNA encoding a 
polypeptide or fragment of the invention, operably linked to suitable transcriptional or translational 
regulatory nucleotide sequences, such as those derived from a mammalian, microbial, viral, or insect 
gene. Examples of regulatory sequences include transcriptional promoters, operators, or enhancers, an 

15 mRNA ribosomal binding site, and appropriate sequences which control transcription and translation 
initiation and termination. Nucleotide sequences are operably linked when the regulatory sequence 
functionally relates to the DNA sequence. Thus, a promoter nucleotide sequence is operably linked to 
a DNA sequence if the promoter nucleotide sequence controls the transcription of the DNA sequence. 
An origin of replication that confers the ability to replicate in the desired host cells, and a selection 

20 gene by which transformants are identified, are generally incorporated into the expression vector. 

In addition, a sequence encoding an appropriate signal peptide (native or heterologous) can be 
incorporated into expression vectors. A DNA sequence for a signal peptide (secretory leader) may be 
fused in frame to the nucleic acid sequence of the invention so that the DNA is initially transcribed^and 
the mRNA translated, into a fusion protein comprising the signal peptide. A signal peptide that is 

25 functional in the intended host cells promotes extracellular secretion of the polypeptide. The signal 
peptide is cleaved from the polypeptide upon secretion of polypeptide from the cell. 

The skilled artisan will also recognize that the position(s) at which the signal peptide is 
cleaved may differ from that predicted by computer program, and may vary according to such factors 
as the type of host cells employed in expressing a recombinant polypeptide. A protein preparation may 

30 include a mixture of protein molecules having different N-terminal amino acids, resulting from 
cleavage of the signal peptide at more than one site. 

Suitable host cells for expression of polypeptides include prokaryotes, yeast or higher 
eukaryotic cells. Mammalian or insect cells are generally preferred for use as host cells. Appropriate 
cloning and expression vectors for use with bacterial, fungal, yeast, and mammalian cellular hosts are 

35 described, for example, in Pouwels et al. Cloning Vectors: A Laboratory Manual, Elsevier, New 
York, (1985). Cell-free translation systems could also be employed to produce polypeptides using 
RNAs derived from DNA constructs disclosed herein. 

Prokarvotic Systems 

40 Prokaryotes include gram-negative or gram-positive organisms. Suitable prokaryotic host 

cells for transformation include, for example, E. coli, Bacillus subtilis, Salmonella typhimurium, and 
various other species within the genera Pseudomonas, Streptomyces, and Staphylococcus. In a 
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5 prokaryotic host cell, such as E. coli, a polypeptide may include an N-terminal methionine residue to 
facilitate expression of the recombinant polypeptide in the prokaryotic host cell. The N-terminal Met 
may be cleaved from the expressed recombinant polypeptide. 

Expression vectors for use in prokaryotic host cells generally comprise one or more 
phenotypic selectable marker genes. A phenotypic selectable marker gene is, for example, a gene 

10 encoding a protein that confers antibiotic resistance or that supplies an autotrophic requirement. 
Examples of useful expression vectors for prokaryotic host cells include those derived from 
commercially available plasmids such as the cloning vector pBR322 (ATCC 37017). pBR322 contains 
genes for ampicillin and tetracycline resistance and thus provides simple means for identifying 
transformed cells. An appropriate promoter and a DNA sequence are inserted into the pBR322 vector. 

15 Other commercially available vectors include, for example, pKK223-3 (Pharmacia Fine Chemicals, 
Uppsala, Sweden) and pGEMl (Promega Biotec, Madison, WI, USA). 

Promoter sequences commonly used for recombinant prokaryotic host cell expression vectors 
include p-lactamase (penicillinase), lactose promoter system (Chang et al., Nature, 275:615 (1978); and 
Goeddel et al., Nature, 257:544 (1979)), tryptophan (trp) promoter system (Goeddel et al., NucL Acids 

20 Res., 5:4057 (1980)); and EP-A-36776 and tac promoter (Maniatis, Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory, p. 412 (1982)). A particularly useful prokaryotic host cell 
expression system employs a phage AP L promoter and a cI857ts thermolabile repressor sequence. 
Plasmid vectors available from the American Type Culture Collection which incorporate derivatives of 
the XP L promoter include plasmid pHUB2 (resident in E. coli strain JMB9, ATCC 37092) and pPLc28 

25 (resident in E. coli RR 1 , ATCC 53082). 

Yeast Systems 

Alternatively, the polypeptides may be expressed in yeast host cells, preferably from the 
Saccharomyces genus {e.g., S. cerevisiae). Other genera of yeast, such as Pichia or Kluyveromyces, 

30 may also be employed. Yeast vectors will often contain an origin of replication sequence from a 2u 
yeast plasmid, an autonomously replicating sequence (ARS), a promoter region, sequences for 
polyadenylation, sequences for transcription termination, and a selectable marker gene. Suitable 
promoter sequences for yeast vectors include, among others, promoters for metallothionein, 3- 
phosphoglycerate kinase (Hitzeman et al., J. Biol. Chem. 255:2073 (1980)) or other glycolytic enzymes 

35 (Hess et al., /. Adv. Enzyme Reg., 7:149 (1968); and Holland et al., Biochem., 77:4900 (1978)), such as 
enolase, glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, 
phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphogIycerate mutase, pyruvate kinase, 
triosephosphate isomerase, phospho-glucose isomerase, and glucokinase. Other suitable vectors and 
promoters for use in yeast expression are further described in Hitzeman, EPA-73,657. Another 

40 alternative is the glucose-repressible ADH2 promoter described by Russell et al., J. Biol Chem., 
255:2674 (1982) and Beier et al., Nature, 300:124 (1982). Shuttle vectors replicable in both yeast and 
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5 E. coli may be constructed by inserting DNA sequences from pBR322 for selection and replication in 
E. coli (Amp r gene and origin of replication) into the above-described yeast vectors. 

The yeast a-factor leader sequence may be employed to direct secretion of the polypeptide. 
The a-factor leader sequence is often inserted between the promoter sequence and the structural gene 
sequence. See, Kurjan et al., Cell 30:933 (1982) and Bitter et al. v Proc. Natl. Acad. Sci. USA, 87:5330 
10 (1984). Other leader sequences suitable for facilitating secretion of recombinant polypeptides from 
yeast hosts are known to those of skill in the art. A leader sequence may be modified near its 3' end to 
contain one or more restriction sites. This will facilitate fusion of the leader sequence to the structural 
gene. 

Yeast transformation protocols are known to those of skill in the art. One such protocol is 
15 described by Hinnen et al., Proc. Natl. Acad. ScL USA, 75:1929 (1978). The Hinnen et al. protocol 
selects for Trp + transformants in a selective medium, wherein the selective medium consists of 0.67% 
yeast nitrogen base, 0.5% casamino acids, 2% glucose, 10 mg/ml adenine and 20 mg/ml uracil. 

Yeast host cells transformed by vectors containing an ADH2 promoter sequence may be 
grown for inducing expression in a "rich" medium. An example of a rich medium is one consisting of 
20 1% yeast extract, 2% peptone, and 1% glucose supplemented with 80 mg/ml adenine and 80 mg/ml 
uracil. Derepression of the ADH2 promoter occurs when glucose is exhausted from the medium. 

Mammalian or Insect Systems 
Mammalian or insect host cell culture systems also may be employed to express recombinant 

25 polypeptides. Bacculovirus systems for production of heterologous proteins in insect cells are 
reviewed by Luckow and Summers, Bio/Technology 6:47 (1988). Established cell lines of mammalian 
origin also may be employed. Examples of suitable mammalian host cell lines include the COS-7 line 
of monkey kidney cells (ATCC CRL 1651) (Gluzman et al., Cell 25:175 (1981)), L cells, C127 cells, 
3T3 cells (ATCC CCL 163), Chinese hamster ovary (CHO) cells, HeLa cells, and BHK (ATCC CRL 

30 10) cell lines, and the CV1/EBNA cell line derived from the African green monkey kidney cell line 
CV1 (ATCC CCL 70) as described by McMahan et al., EM BO J. 10: 2821 (1991). 

Established methods for introducing DNA into mammalian cells have been described 
(Kaufman, R.J., Large Scale Mammalian Cell Culture, pp. 15-69 (1990)). Additional protocols using 
commercially available reagents, such as Lipofectamine lipid reagent (Gibco/BRL) or Lipofectamine- 

35 Plus lipid reagent, can be used to transfect cells (Feigner et al., Proc. Natl. Acad. Sci. USA 84:7413- 
7417 (1987)). In addition, electroporation can be used to transfect mammalian cells using conventional 
procedures, such as those in Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd ed. Vol. 
1-3, Cold Spring Harbor Laboratory Press (1989). Selection of stable transformants can be performed 
using methods known in the art, such as, for example, resistance to cytotoxic drugs. Kaufman et al., 

40 Meth. in Enzymology 755:487-511 (1990), describes several selection schemes, such as dihydrofolate 
reductase (DHFR) resistance. A suitable host strain for DHFR selection can be CHO strain DX-B11, 
which is deficient in DHFR (Urlaub and Chasin, Proc. Natl. Acad. ScL USA 77:4216-4220 (1980)). A 
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5 plasmid expressing the DHFR cDNA can be introduced into strain DX-B1 1, and only cells that contain 
the plasmid can grow in the appropriate selective media. Other examples of selectable markers that can 
be incorporated into an expression vector include cDNAs conferring resistance to antibiotics, such as 
G418 and hygromycin B. Cells harboring the vector can be selected on the basis of resistance to these 
compounds. 

10 Transcriptional and translational control sequences for mammalian host cell expression 

vectors can be excised from viral genomes. Commonly used promoter sequences and enhancer 
sequences are derived from polyoma virus, adenovirus 2, simian virus 40 (SV40), and human 
cytomegalovirus. DNA sequences derived from the SV40 viral genome, for example, SV40 origin, 
early and late promoter, enhancer, splice, and polyadenylation sites can be used to provide other 

1 5 genetic elements for expression of a structural gene sequence in a mammalian host cell. Viral early and 
late promoters are particularly useful because both are easily obtained from a viral genome as a 
fragment, which can also contain a viral origin of replication (Fiers et al., Nature 273:113 (1978); 
Kaufman et al., Meth. in Enzymology 755:487-51 1 (1990)). Smaller or larger SV40 fragments can also 
be used, provided the approximately 250 bp sequence extending from the Hind III site toward the Bgl I 

20 site located in the SV40 viral origin of replication site is included. 

Additional control sequences shown to improve expression of heterologous genes from 
mammalian expression vectors include such elements as the expression augmenting sequence element 
(EASE) derived from CHO cells (Morris et al., Animal Cell Technology, pp. 529-534 (1997)) and PCT 
Application WO 97/25420 and the tripartite leader (TPL) and VA gene RNAs from Adenovirus 2 

25 (Gingeras et al., /. Biol Chem. 257:13475-13491 (1982)). The internal ribosome entry site (IRES) 
sequences of viral origin allows dicistronic mRNAs to be translated efficiently (Oh and Sarnow, 
Current Opinion in Genetics and Development 3:295-300 (1993); Ramesh et al., Nucleic Acids 
Research 24:2697-2700 (1996)). Expression of a heterologous cDNA as part of a dicistronic mRNA 
followed by the gene for a selectable marker (e.g. DHFR) has been shown to improve transferability 

30 of the host and expression of the heterologous cDNA (Kaufman et al., Meth. in Enzymology 1 35:487 - 
511 (1990)). Exemplary expression vectors that employ dicistronic mRNAs are pTR-DC/GFP 
described by Mosser et al., Biotechniques 22:150-161 (1997), and p2A5I described by Morris et al., 
Animal Cell Technology, pp. 529-534 (1997). 

A useful high expression vector, pCAVNOT, has been described by Mosley et al., Cell 

35 59:335-348 (1989). Other expression vectors for use in mammalian host cells can be constructed as 
disclosed by Okayama and Berg, Mol. Cell. Biol. 5:280 (1983). A useful system for stable high level 
expression of mammalian cDNAs in CI 27 murine mammary epithelial cells can be constructed 
substantially as described by Cosman et al., Mol. Immunol. 23:935 (1986). A useful high expression 
vector, PMLSV N1/N4, described by Cosman et al., Nature 3/2:768 (1984), has been deposited as 

40 ATCC 39890. Additional useful mammalian expression vectors are described in EP-A-0367566, and 
in WO 91/18982, incorporated by reference herein. In yet another alternative, the vectors can be 
derived from retroviruses. 
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5 Additional useful expression vectors, pFLAG® and pDC311, can also be used. FLAG® 

technology is centered on the fusion of a low molecular weight (lkD), hydrophilic, FLAG® marker 
peptide to the N-terminus of a recombinant protein expressed by pFLAG® expression vectors. pDC31 1 
is another specialized vector used for expressing proteins in CHO cells. pDC31 1 is characterized by a 
bicistronic sequence containing the gene of interest and a dihydrofolate reductase (DHFR) gene with an 

10 internal ribosome binding site for DHFR translation, an expression augmenting sequence element 
(EASE), the human CMV promoter, a tripartite leader sequence, and a polyadenylation site. 

Regarding signal peptides that may be employed, the native signal peptide may be replaced by 
a heterologous signal peptide or leader sequence, if desired. The choice of signal peptide or leader may 
depend on factors such as the type of host cells in which the recombinant polypeptide is to be 

15 produced. To illustrate, examples of heterologous signal peptides that are functional in mammalian 
host cells include the signal sequence for interleukin-7 (TL-7) described in United States Patent 
4,965,195; the signal sequence for interleukin-2 receptor described in Cosman et al., Nature^ 312:768 
(1984); the interleukin-4 receptor signal peptide described in EP 367,566; the type I interleukin-1 
receptor signal peptide described in U.S. Patent 4,968,607; and the type II interleukin-1 receptor signal 

20 peptide described in EP 460,846. 

Purification 

The invention also includes methods of isolating and purifying the polypeptides and fragments 

thereof. 

25 

Isolation and Purification 
The "isolated" polypeptides or fragments thereof encompassed by this invention are 
polypeptides or fragments that are not in an environment identical to an environment in which it or they 
can be found in nature. The "purified" polypeptides or fragments thereof encompassed by this 
30 invention are essentially free of association with other proteins or polypeptides, for example, as a 
purification product of recombinant expression systems such as those described above or as a purified 
product from a non-recombinant source such as naturally occurring cells and/or tissues. 

In one preferred embodiment, the purification of recombinant polypeptides or fragments can 
be accomplished using fusions of polypeptides or fragments of the invention to another polypeptide to 
35 aid in the purification of polypeptides or fragments of the invention. Such fusion partners can include 
the poly-His or other antigenic identification peptides described above as well as the Fc moieties 
described previously. 

With respect to any type of host cell, as is known to the skilled artisan, procedures for 
purifying a recombinant polypeptide or fragment will vary according to such factors as the type of host 
40 cells employed and whether or not the recombinant polypeptide or fragment is secreted into the culture 
medium. 
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5 In general, the recombinant polypeptide or fragment can be isolated from the host cells if not 

secreted, or from the medium or supernatant if soluble and secreted, followed by one or more 
concentration, salting-out, ion exchange, hydrophobic interaction, affinity purification or size exclusion 
chromatography steps. As to specific ways to accomplish these steps, the culture medium first can be 
concentrated using a commercially available protein concentration filter, for example, an Amicon or 

10 Millipore Pellicon ultrafiltration unit. Following the concentration step, the concentrate can be applied 
to a purification matrix such as a gel filtration medium. Alternatively, an anion exchange resin can be 
employed, for example, a matrix or substrate having pendant diethylaminoethyl (DEAE) groups. The 
matrices can be acrylamide, agarose, dextran, cellulose or other types commonly employed in protein 
purification. Alternatively, a cation exchange step can be employed. Suitable cation exchangers 

15 include various insoluble matrices comprising sulfopropyl or carboxymethyl groups. In addition, a 
chromatofocusing step can be employed. Alternatively, a hydrophobic interaction chromatography 
step can be employed. Suitable matrices can be phenyl or octyl moieties bound to resins. In addition, 
affinity chromatography with a matrix which selectively binds the recombinant protein can be 
employed. Examples of such resins employed are lectin columns, dye columns, and metal-chelating 

20 columns. Finally, one or more re versed-phase high performance liquid chromatography (RP-HPLC) 
steps employing hydrophobic RP-HPLC media, (e.g., silica gel or polymer resin having pendant 
methyl, octyl, octyldecyl or other aliphatic groups) can be employed to further purify the polypeptides. 
Some or all of the foregoing purification steps, in various combinations, are well known and can be 
employed to provide an isolated and purified recombinant protein. 

25 It is also possible to utilize an affinity column comprising a polypeptide-binding protein of the 

invention, such as a monoclonal antibody generated against polypeptides of the invention, to affinity- 
purify expressed polypeptides. These polypeptides can be removed from an affinity column using 
conventional techniques, e.g., in a high salt elution buffer and then dialyzed into a lower salt buffer for 
use or by changing pH or other components depending on the affinity matrix utilized, or be 

30 competitively removed using the naturally occurring substrate of the affinity moiety, such as a 
polypeptide derived from the invention. 

In this aspect of the invention, polypeptide-binding proteins, such as the anti-polypeptide 
antibodies of the invention or other proteins that may interact with the polypeptide of the invention, can 
be bound to a solid phase support such as a column chromatography matrix or a similar substrate 

35 suitable for identifying, separating, or purifying cells that express polypeptides of the invention on their 
surface. Adherence of polypeptide-binding proteins of the invention to a solid phase contacting surface 
can be accomplished by any means, for example, magnetic microspheres can be coated with these 
polypeptide-binding proteins and held in the incubation vessel through a magnetic field. Suspensions 
of cell mixtures are contacted with the solid phase that has such polypeptide-binding proteins thereon. 

40 Cells having polypeptides of the invention on their surface bind to the fixed polypeptide-binding 
protein and unbound cells then are washed away. This affinity-binding method is useful for purifying, 
screening, or separating such polypeptide-expressing cells from solution. Methods of releasing 
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5 positively selected cells from the solid phase are known in the art and encompass, for example, the use 
of enzymes. Such enzymes are preferably non-toxic and non-injurious to the cells and are preferably 
directed to cleaving the cell-surface binding partner. 

Alternatively, mixtures of cells suspected of containing polypeptide-expressing cells of the 
invention first can be incubated with a biotinylated polypeptide-binding protein of the invention. 

10 Incubation periods are typically at least one hour in duration to ensure sufficient binding to 
polypeptides of the invention. The resulting mixture then is passed through a column packed with 
avidin-coated beads, whereby the high affinity of biotin for avidin provides the binding of the 
polypeptide-binding cells to the beads. Use of avidin-coated beads is known in the art. See, Berenson, 
et ah, J. Cell. Biochem., 10D:239 (1986). Wash of unbound material and the release of the bound cells 

15 is performed using conventional methods. 

The desired degree of purity depends on the intended use of the protein. A relatively high 
degree of purity is desired when the polypeptide is to be administered in vivo, for example. In such a 
case, the polypeptides are purified such that no protein bands corresponding to other proteins are 
detectable upon analysis by SDS-polyacrylamide gel electrophoresis (SDS-PAGE). It will be 

20 recognized by one skilled in the pertinent field that multiple bands corresponding to the polypeptide 
may be visualized by SDS-PAGE, due to differential glycosylation, differential post-translational 
processing, and the like. Most preferably, the polypeptide of the invention is purified to substantial 
homogeneity, as indicated by a single protein band upon analysis by SDS-PAGE. The protein band 
may be visualized by silver staining, Coomassie blue staining, or (if the protein is radiolabeled) by 

25 autoradiography. 

Assays 

The purified polypeptides of the invention (including proteins, polypeptides, fragments, 
variants, oligomers, and other forms) may be tested for the ability to bind the binding partner in any 

30 suitable assay, such as a conventional binding assay. To illustrate, the polypeptide may be labeled with 
a detectable reagent (e.g., a radionuclide, chromophore, enzyme that catalyzes a colorimetric or 
fluorometric reaction, and the like). The labeled polypeptide is contacted with cells expressing the 
binding partner. The cells then are washed to remove unbound labeled polypeptide, and the presence 
of cell-bound label is determined by a suitable technique, chosen according to the nature of the label. 

35 One example of a binding assay procedure is as follows. A recombinant expression vector 

containing binding partner cDNA is constructed using methods well known in the art. The binding 
partner comprises an N-terminal cytoplasmic domain, a transmembrane region, and a C-terminal 
extracellular domain. CV1-EBNA-1 cells in 10 cm 2 dishes are transfected with the recombinant 
expression vector. CV-l/EBNA-1 cells (ATCC CRL 10478) constitutively express EBV nuclear 

40 antigen- 1 driven from the CMV immediate-early enhancer/promoter. CV1-EBNA-1 was derived from 
the African Green Monkey kidney cell line CV-1 (ATCC CCL 70), as described by McMahan et al., 
EM BO J., 10:2821 (1991). 
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5 The transfected cells are cultured for 24 hours, and the cells in each dish then are split into a 

24- well plate. After culturing an additional 48 hours, the transfected cells (about 4 x 10 4 eel Is/ well) are 
washed with BM-NFDM, which is binding medium (RPMI 1640 containing 25 mg/ml bovine serum 
albumin, 2 mg/ml sodium azide, 20 mM Hepes pH 7.2) to which 50 mg/ml nonfat dry milk has been 
added. The cells then are incubated for 1 hour at 37°C with various concentrations of, for example, a 

10 soluble polypeptide/Fc fusion protein made as set forth above. Cells then are washed and incubated 
with a constant saturating concentration of a 125 I- mouse anti-human IgG in binding medium, with 
gentle agitation for 1 hour at 37°C. After extensive washing, cells are released via trypsinization. 

The mouse anti-human IgG employed above is directed against the Fc region of human IgG 
and can be obtained from Jackson Immunoresearch Laboratories, Inc., West Grove, PA. The antibody 

15 is radioiodtnated using the standard chloramine-T method. The antibody will bind to the Fc portion of 
any polypeptide/Fc protein that has bound to the cells. In all assays, non-specific binding of 125 I- 
antibody is assayed in the absence of the Fc fusion protein/Fc, as well as in the presence of the Fc 
fusion protein and a 200-fold molar excess of unlabeled mouse anti-human IgG antibody. 

Cell-bound 125 I-antibody is quantified on a Packard Autogamma counter. Affinity 

20 calculations (Scatchard, Ann. N.Y. Acad. Sci., 51:660 (1949)) are generated on RS/1 (BBN Software, 
Boston, MA) run on a Micro vax computer. 

Another type of suitable binding assay is a competitive binding assay. To illustrate, biological 
activity of a variant may be determined by assaying for the variant's ability to compete with the native 
protein for binding to the binding partner. 

25 Competitive binding assays can be performed by conventional methodology. Reagents that 

may be employed in competitive binding assays include radiolabeled SVPH and intact cells expressing 
the binding partner (endogenous or recombinant) on the cell surface. For example, a radiolabeled 
soluble SVPH fragment can be used to compete with a soluble SVPH variant for binding to cells 
expressing the binding partner on the surface. Instead of intact cells, one could substitute a soluble 

30 binding partner/Fc fusion protein bound to a solid phase through the interaction of Protein A or Protein 
G (on the solid phase) with the Fc moiety. Chromatography columns that contain Protein A and 
Protein G include those available from Pharmacia Biotech, Inc., Piscataway, NJ. 

Another type of competitive binding assay utilizes radiolabeled soluble binding partner, such 
as a soluble binding partner/Fc fusion protein, and intact cells expressing SVPH. Qualitative results 

35 can be obtained by competitive autoradiographic plate binding assays, while (Scatchard plots 
Scatchard, Ann. N.Y. Acad. Sci. 51:660 (1949)) may be utilized to generate quantitative results. 

USE OF SVPH NUCLEIC ACID OR OLIGONUCLEOTIDES 

In addition to being used to express polypeptides as described above, the nucleic acids of the 
40 invention, including DNA, RNA, mRNA, and oligonucleotides thereof can be used: 

as probes to identify nucleic acid encoding proteins having proteinase activity; 
to identify human chromosome number 1 or 4; 
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5 to map genes on human chromosome number 1 or 4; 

to identify genes associated with certain diseases, syndromes, or other 
conditions associated with human chromosome number 1 or 4; 

as single-stranded sense or antisense oligonucleotides, to inhibit expression of 
polypeptides encoded by the SVPH-1, SVPH-3, or SVPH-4 gene; 
10 - to detect defective genes in an individual; and 

for gene therapy. 

Probes 

The nucleotides of the invention can be used as probes to identify nucleic acid encoding 
proteins having similar activity or structure. Such uses include the use of fragments. Such fragments 
may comprise any length of contiguous nucleotides. In one embodiment, the fragment comprises at 
least about 17 contiguous nucleotides of a DNA sequence. In other embodiments, a DNA fragment 
comprises at least 30, or at least 60, contiguous nucleotides of a DNA sequence. 

Because homologs of SEQ ID NOs:l-3 and 7-11, from other mammalian species are 
contemplated herein, probes based on the human DNA sequence of SEQ ID NOs:l-3 and 7-11 may be 
used to screen cDNA libraries derived from other mammalian species, using conventional cross-species 
hybridization techniques. 

Using knowledge of the genetic code in combination with the amino acid sequences set forth 
above, sets of degenerate oligonucleotides can be prepared. Such oligonucleotides are useful as 
primers, e.g., in polymerase chain reactions (PCR), whereby DNA fragments are isolated and 
amplified. 

Chromosome Mapping 

All or a portion of the nucleic acids of SEQ ID NOs:l-3 and 7-1 1, including oligonucleotides, 
30 can be used by those skilled in the art using well-known techniques to identify human chromosomes 
and the specific locus thereof, that contains the DNA of SVPH family members. For example, all or a 
portion of SEQ ID NO:3, SEQ ID NO: 10, and SEQ ID NO: 11 can be used to identify human 
chromosome 1. In addition, all or a portion of SEQ ID NO: 1, SEQ ID NO:7, SEQ ID NO:8, and SEQ 
ID NO:9 can be used to identify human chromosome 4. Useful techniques include, but are not limited 
35 to, using the sequence or portions, including oligonucleotides, as a probe in various well-known 
techniques such as radiation hybrid mapping (high resolution), in situ hybridization to chromosome 
spreads (moderate resolution), and Southern blot hybridization to hybrid cell lines containing 
individual human chromosomes (low resolution). 

For example, chromosomes can be mapped by radiation hybrid mapping. First, PCR is 
40 performed using the Whitehead Institute/MIT Center for Genome Research Genebridge4 panel of 93 
radiation hybrids (www-genome.wi.mit.edu/ftp/distribution/human_STS_releases/july97/rhmap/ 
genebridge4.html). Primers are used which lie within a putative exon of the gene of interest and which 
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5 amplify a product from human genomic DNA, but do not amplify hamster genomic DNA. The results 
of the PCRs are converted into a data vector that is submitted to the Whitehead/MIT Radiation 
Mapping site on the internet (www-seq. wi.mit.edu). The data is scored and the chromosomal 
assignment and placement relative to known Sequence Tag Site (STS) markers on the radiation hybrid 
map is provided. The following web site provides additional information about radiation hybrid 
10 mapping: www-genome.wi.imt.edu/ftp/distribution/human_STS_releases/july97/07-97.IOT 

Identifying Associated Diseases 

As set forth below, sequences encoding SVPH-4a and SVPH-4b have been mapped by 
radiation hybrid mapping to the lp 11-13 region of chromosome 1. That region is associated with 

15 specific diseases which include but are not limited to fetal hydantoin syndrome, diphenylhydantoin 
toxicity, and pheochromocytoma. Thus, the nucleic acid of SEQ ID Nos:3, 10 and 1 1, or a fragment 
thereof, can be used by one skilled in the art using well-known techniques to analyze abnormalities 
associated with SVPH-4 genes. In addition, sequences encoding SVPH-la, SVPH-lb, and SVPH-lc 
have been mapped by radiation hybrid mapping to the 4q34 region of chromosome 4. Thus, the nucleic 

20 acid of SEQ ID Nos:l, 7, 8, and 9, or a fragment thereof, can be used by one skilled in the art using 
well-known techniques to analyze abnormalities associated with SVPH-1 genes. Similarly, a skilled 
artisan can use the nucleic acid of SEQ ID No.:2, or a fragment thereof, to analyze abnormalities 
associated with SVPH-3 genes. This enables one to distinguish conditions in which this marker is 
rearranged or deleted. In addition, nucleic acid of SEQ ID NOs:l-3 and 7-1 1 or a fragment thereof can 

25 be used as a positional marker to map other genes of unknown location. 

The DNA may be used in developing treatments for any disorder mediated (directly or 
indirectly) by defective, or insufficient amounts of, the genes corresponding to the nucleic acids of the 
invention. Disclosure herein of native nucleotide sequences permits the detection of defective genes, 
and the replacement thereof with normal genes. Defective genes may be detected in in vitro diagnostic 

30 assays, and by comparison of a native nucleotide sequence disclosed herein with that of a gene derived 
from a person suspected of harboring a defect in this gene. 

Sense- Anti sense 

Other useful fragments of the nucleic acids include antisense or sense oligonucleotides 
35 comprising a single-stranded nucleic acid sequence (either RNA or DNA) capable of binding to target 
mRNA (sense) or DNA (antisense) sequences. Antisense or sense oligonucleotides, according to the 
present invention, comprise a fragment of SEQ ID NOs:l-3 or 7-11. Such a fragment generally 
comprises at least about 14 nucleotides, preferably from about 14 to about 30 nucleotides. The ability 
to derive an antisense or a sense oligonucleotide, based upon a cDNA sequence encoding a given 
40 protein is described in, for example, Stein and Cohen, Cancer Res., 48:2659 (1988) and van der Krol et 
al., BioTechniques, 6:958 (1988). 



34 



Metalloproteinase-Disintegrin Polypeptides 



2517-USB 



Binding of antisense or sense oligonucleotides to target nucleic acid sequences results in the 
formation of duplexes that block or inhibit protein expression by one of several means, including 
enhanced degradation of the mRNA by RNAseH, inhibition of splicing, premature termination of 
transcription or translation, or by other means. The antisense oligonucleotides thus may be used to 
block expression of proteins. Antisense or sense oligonucleotides further comprise oligonucleotides 
having modified sugar-phosphodiester backbones (or other sugar linkages, such as those described in 
W09 1/06629) and wherein such sugar linkages are resistant to endogenous nucleases. Such 
oligonucleotides with resistant sugar linkages are stable in vivo (i.e., capable of resisting enzymatic 
degradation) but retain sequence specificity to be able to bind to target nucleotide sequences. 

Other examples of sense or antisense oligonucleotides include those oligonucleotides which 
are covalently linked to organic moieties, such as those described in WO 90/10448, and other moieties 
that increases affinity of the oligonucleotide for a target nucleic acid sequence, such as poly-(L- lysine). 
Further still, intercalating agents, such as ellipticine, and alkylating agents or metal complexes may be 
attached to sense or antisense oligonucleotides to modify binding specificities of the antisense or sense 
oligonucleotide for the target nucleotide sequence. 

Antisense or sense oligonucleotides may be introduced into a cell containing the target nucleic 
acid sequence by any gene transfer method, including, for example, lipofection, CaP0 4 -mediated DNA 
transfection, electroporation, or by using gene transfer vectors such as Epstein-Barr virus. 

Sense or antisense oligonucleotides also may be introduced into a cell containing the target 
nucleotide sequence by formation of a conjugate with a ligand binding molecule, as described in WO 
91/04753. Suitable ligand binding molecules include, but are not limited to, cell surface receptors, 
growth factors, other cytokines, or other ligands that bind to cell surface receptors. Preferably, 
conjugation of the ligand binding molecule does not substantially interfere with the ability of the ligand 
binding molecule to bind to its corresponding molecule or receptor, or block entry of the sense or 
antisense oligonucleotide or its conjugated version into the cell. 

Alternatively, a sense or an antisense oligonucleotide may be introduced into a cell containing 
the target nucleic acid sequence by formation of an oligonucleotide-lipid complex, as described in WO 
90/10448. The sense or antisense oligonucleotide-lipid complex is preferably dissociated within the 
cell by an endogenous lipase. 

USE OF SVPH POLYPEPTIDES AND FRAGMENTED POLYPEPTIDES 
Uses include, but are not limited to, the following: 

Purifying proteins and measuring activity thereof 
Delivery Agents 

Therapeutic and Research Reagents 

Molecular weight and Isoelectric focusing markers 

Controls for peptide fragmentation 
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5 - Identification of unknown proteins 

Preparation of Antibodies 

Purification Reagents 

Each of the polypeptides of the invention finds use as a protein purification reagent. For 
10 example, the polypeptides may be used to purify binding partner proteins. In particular embodiments, a 
polypeptide (in any form described herein that is capable of binding the binding partner) is attached to 
a solid support by conventional procedures. As one example, affinity chromatography columns 
containing functional groups that will react with functional groups on amino acid side chains of 
proteins are available (Pharmacia Biotech, Inc., Piscataway, NJ). In an alternative, a polypeptide/Fc 
1 5 protein (as discussed above) is attached to Protein A- or Protein G-containing chromatography columns 
through interaction with the Fc moiety. 

The polypeptide also finds use in purifying or identifying cells that express the binding partner 
on the cell surface. Polypeptides are bound to a solid phase such as a column chromatography matrix 
or a similar suitable substrate. For example, magnetic microspheres can be coated with the 
20 polypeptides and held in an incubation vessel through a magnetic field. Suspensions of cell mixtures 
containing the binding partner expressing cells are contacted with the solid phase having the 
polypeptides thereon. Cells expressing the binding partner on the cell surface bind to the fixed 
polypeptides, and unbound cells then are washed away. 

Alternatively, the polypeptides can be conjugated to a detectable moiety, then incubated with 
25 cells to be tested for binding partner expression. After incubation, unbound labeled matter is removed 
and the presence or absence of the detectable moiety on the cells is determined. 

In a further alternative, mixtures of cells suspected of containing cells expressing the binding 
partner are incubated with biotinylated polypeptides. Incubation periods are typically at least one hour 
in duration to ensure sufficient binding. The resulting mixture then is passed through a column packed 
30 with avidin-coated beads, whereby the high affinity of biotin for avidin provides binding of the desired 
cells to the beads. Procedures for using avidin-coated beads are known. See, Berenson, et al., J, Cell. 
Biochem., 10D:239 (1986). Washing to remove unbound material, and the release of the bound cells, 
are performed using conventional methods. 

35 Measuring Activity 

Polypeptides also find use in measuring the biological activity of the binding partner protein in 
terms of their binding affinity. The polypeptides thus may be employed by those conducting "quality 
assurance" studies, e.g., to monitor shelf life and stability of protein under different conditions. For 
example, the polypeptides may be employed in a binding affinity study to measure the biological 

40 activity of a binding partner protein that has been stored at different temperatures, or produced in 
different cell types. The proteins also may be used to determine whether biological activity is retained 
after modification of a binding partner protein (e.g., chemical modification, truncation, mutation, etc.). 
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The binding affinity of the modified binding partner protein is compared to that of an unmodified 
binding partner protein to detect any adverse impact of the modifications on biological activity of the 
binding partner. The biological activity of a binding partner protein thus can be ascertained before it is 
used in a research study, for example. 

Delivery Agents 

The polypeptides also find use as carriers for delivering agents attached thereto to cells 
bearing the binding partner (or to other cell types found to express the binding partner on the cell 
surface) in in vitro or in vivo procedures. 

Detectable (diagnostic) and therapeutic agents that may be attached to a polypeptide include, 
but are not limited to, toxins, other cytotoxic agents, drugs, radionuclides, chromophores, enzymes that 
catalyze a colorimetric or fluorometric reaction, and the like, with the particular agent being chosen 
according to the intended application. Among the toxins are ricin, abrin, diphtheria toxin, 
Pseudomonas aeruginosa exotoxin A, ribosomal inactivating proteins, mycotoxins such as 
trichothecenes, and derivatives and fragments (e.g., single chains) thereof. Radionuclides suitable for 
diagnostic use include, but are not limited to, 123 I, 13, I, ""Tc, 11 'in, and 76 Br. Examples of 
radionuclides suitable for therapeutic use are l3I I, 211 At, 77 Br, 186 Re, I88 Re, 212 Pb, 212 Bi, I0 *Pd, M Cu, and 
67 Cu. 

Such agents may be attached to the polypeptide by any suitable conventional procedure. The 
polypeptide comprises functional groups on amino acid side chains that can be reacted with functional 
groups on a desired agent to form covalent bonds, for example. Alternatively, the protein or agent may 
be derivatized to generate or attach a desired reactive functional group. The derivatization may involve 
attachment of one of the Afunctional coupling reagents available for attaching various molecules to 
proteins (Pierce Chemical Company, Rockford, Illinois). A number of techniques for radiolabeling 
proteins are known. Radionuclide metals may be attached to polypeptides by using a suitable 
bifunctional chelating agent, for example. 

Conjugates comprising polypeptides and a suitable diagnostic or therapeutic agent (preferably 
covalently linked) are thus prepared. The conjugates are administered or otherwise employed in an 
amount appropriate for the particular application. 

Therapeutic Agents 

Polypeptides of the invention may be used in developing treatments for any disorder mediated 
(directly or indirectly) by defective, or insufficient amounts of the polypeptides. These polypeptides 
may be administered to a mammal afflicted with such a disorder. 

The polypeptides may also be employed in inhibiting a biological activity of the binding 
partner, in in vitro or in vivo procedures. For example, a purified polypeptide may be used to inhibit 
binding of the binding partner to an endogenous cell surface binding partner. Biological effects that 
result from the binding of SVPH to endogenous binding partner thus are inhibited. 
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5 In addition, an SVPH binding partner may be administered to a mammal to treat a binding 

partner-mediated disorder. Such binding partner-mediated disorders include conditions caused 
(directly or indirectly) or exacerbated by the binding partner. 

Compositions of the present invention may contain a polypeptide in any form described 
herein, such as native proteins, variants, derivatives, oligomers, and biologically active fragments. In 
10 particular embodiments, the composition comprises a soluble polypeptide or an oligomer comprising 
soluble SVPH polypeptides or SVPH binding partner polypeptides. 

Compositions comprising an effective amount of a polypeptide of the present invention, in 
combination with other components such as a physiologically acceptable diluent, carrier, or excipient, 
are provided herein. The polypeptides can be formulated according to known methods used to prepare 

15 pharmaceutical^ useful compositions. They can be combined in admixture, either as the sole active 
material or with other known active materials suitable for a given indication, with pharmaceutically 
acceptable diluents (e.g., saline, Tris-HCl, acetate, and phosphate buffered solutions), preservatives 
(e.g., thimerosal, benzyl alcohol, parabens), emulsifiers, solubilizers, adjuvants and/or carriers. 
Suitable formulations for pharmaceutical compositions include those described in Remington's 

20 Pharmaceutical Sciences, 16th ed. 1980, Mack Publishing Company, Easton, PA. 

In addition, such compositions can be complexed with polyethylene glycol (PEG), metal ions, 
or incorporated into polymeric compounds such as polyacetic acid, polyglycolic acid, hydrogels, 
dextran, etc., or incorporated into liposomes, microemulsions, micelles, unilamellar or multilamellar 
vesicles, erythrocyte ghosts or spheroblasts. Such compositions will influence the physical state, 

25 solubility, stability, rate of in vivo release, and rate of in vivo clearance, and are thus chosen according 
to the intended application. 

The compositions of the invention can be administered in any suitable manner, e.g., topically, 
parenterally, or by inhalation. The term "parenteral" includes injection, e.g., by subcutaneous, 
intravenous, or intramuscular routes, also including localized administration, e.g., at a site of disease or 

30 injury. Sustained release from implants is also contemplated. One skilled in the pertinent art will 
recognize that suitable dosages will vary, depending upon such factors as the nature of the disorder to 
be treated, the patient's body weight, age, and general condition, and the route of administration. 
Preliminary doses can be determined according to animal tests, and the scaling of dosages for human 
administration is performed according to art-accepted practices. 

35 Compositions comprising nucleic acids in physiologically acceptable formulations are also 

contemplated. DNA may be formulated for injection, for example. 

Research Agents 

Another use of the polypeptide of the present invention is as a research tool for studying the 
40 biological effects that result from inhibiting binding partner/SVPH interactions on different cell types. 
Polypeptides also may be employed in in vitro assays for detecting the binding partner or SVPH or the 
interactions thereof. 
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5 Molecular Weight, Isoelectric Point Markers 

The polypeptides of the present invention can be subjected to fragmentation into smaller 
peptides by chemical and enzymatic means, and the peptide fragments so produced can be used in the 
analysis of other proteins or polypeptides. For example, such peptide fragments can be used as peptide 
molecular weight markers, peptide isoelectric point markers, or in the analysis of the degree of peptide 

10 fragmentation. Thus, the invention also includes these polypeptides and peptide fragments, as well as 
kits to aid in the determination of the apparent molecular weight and isoelectric point of an unknown 
protein and kits to assess the degree of fragmentation of an unknown protein. 

Although all methods of fragmentation are encompassed by the invention, chemical 
fragmentation is a preferred embodiment, and includes the use of cyanogen bromide to cleave under 

15 neutral or acidic conditions such that specific cleavage occurs at methionine residues (E. Gross, 
Methods in Enz., 11:238-255 (1967)). This can further include additional steps, such as a 
carboxymethylation step to convert cysteine residues to an unreactive species. Table 1 summarizes the 
fragmentation pattern of SEQ ID NOs: 12-16 following chemical cleavage with cyanogen bromide. 

Enzymatic fragmentation is another preferred embodiment, and includes the use of a protease 

20 such as Asparaginylendo-peptidase, Arginylendo-peptidase, Achromobacter protease I, Trypsin, 
Staphlococcus aureus V8 protease, Endoproteinase Asp-N, or Endoproteinase Lys-C under 
conventional conditions to result in cleavage at specific amino acid residues. Asparaginylendo- 
peptidase can cleave specifically on the carboxyl side of the asparagine residues present within the 
polypeptides of the invention. Arginylendo-peptidase can cleave specifically on the carboxyl side of 

25 the arginine residues present within these polypeptides. Achromobacter protease I can cleave 
specifically on the carboxyl side of the lysine residues present within the polypeptides (Sakiyama and 
Nakat, U.S. Patent No. 5,248,599; T. Masaki et al., Biochim. Biophys. Acta, 660:44-50 (1981); T. 
Masaki et al., Biochim. Biophys. Acta, 660:51-55 (1981)). Trypsin can cleave specifically on the 
carboxyl side of the arginine and lysine residues present within polypeptides of the invention. 

30 Enzymatic fragmentation may also occur with a protease that cleaves at multiple amino acid residues. 
For example, Staphlococcus aureus V8 protease can cleave specifically on the carboxyl side of the 
aspartic and glutamic acid residues present within polypeptides (D. W. Cleveland, J. Biol. Chem., 
5:1102-1106 (1977)). Endoproteinase Asp-N can cleave specifically on the amino side of the 
asparagine residues present within polypeptides. Endoproteinase Lys-C can cleave specifically on the 

35 carboxyl side of the lysine residues present within polypeptides of the invention. Other enzymatic and 
chemical treatments can likewise be used to specifically fragment these polypeptides into a unique set 
of specific peptides. 

Of course, the peptides and fragments of the polypeptides of the invention can also be 
produced by conventional recombinant processes and synthetic processes well known in the art. With 
40 regard to recombinant processes, the polypeptides and peptide fragments encompassed by invention 
can have variable molecular weights, depending upon the host cell in which they are expressed. 
Glycosylation of polypeptides and peptide fragments of the invention in various cell types can result in 
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5 variations of the molecular weight of these pieces, depending upon the extent of modification. The size 
of these pieces can be most heterogeneous with fragments of polypeptide derived from the extracellular 
portion of the polypeptide. Consistent polypeptides and peptide fragments can be obtained by using 
polypeptides derived entirely from the transmembrane and cytoplasmic regions, pretreating with N- 
glycanase to remove glycosylation, or expressing the polypeptides in bacterial hosts. 

10 The molecular weight of these polypeptides can also be varied by fusing additional peptide 

sequences to both the amino and carboxyl terminal ends of polypeptides of the invention. Fusions of 
additional peptide sequences at the amino and carboxyl terminal ends of polypeptides of the invention 
can be used to enhance expression of these polypeptides or aid in the purification of the protein. In 
addition, fusions of additional peptide sequences at the amino and carboxyl terminal ends of 

15 polypeptides of the invention will alter some, but usually not all, of the fragmented peptides of the 
polypeptides generated by enzymatic or chemical treatment. Of course, mutations can be introduced 
into polypeptides of the invention using routine and known techniques of molecular biology. For 
example, a mutation can be designed so as to eliminate a site of proteolytic cleavage by a specific 
enzyme or a site of cleavage by a specific chemically induced fragmentation procedure. The 

20 elimination of the site will alter the peptide fingerprint of polypeptides of the invention upon 
fragmentation with the specific enzyme or chemical procedure. 

When the invention relates to the use of fragmented peptide molecular weight markers, those 
markers are preferably at least 10 amino acids in size. More preferably, these fragmented peptide 
molecular weight markers are between 10 and 100 amino acids in size. Even more preferable are 

25 fragmented peptide molecular weight markers between 10 and 50 amino acids in size and especially 
between 10 and 35 amino acids in size. Most preferable are fragmented peptide molecular weight 
markers between 10 and 20 amino acids in size. 

Because the unique amino acid sequence of each fragment specifies a molecular weight, these 
fragments can thereafter serve as molecular weight markers using such analysis techniques to assist in 

30 the determination of the molecular weight of an unknown protein, polypeptides or fragments thereof. 
The molecular weight markers of the invention serve particularly well as molecular weight markers for 
the estimation of the apparent molecular weight of proteins that have similar apparent molecular 
weights and, consequently, allow increased accuracy in the determination of apparent molecular weight 
of proteins. 

35 Among the methods for determining molecular weight are sedimentation, gel electrophoresis, 

chromatography, and mass spectrometry. A particularly preferred embodiment is denaturing 
polyacrylamide gel electrophoresis (U. K. Laemmli, Nature, 227:680-685 (1970)). Conventionally, the 
method uses two separate lanes of a gel containing sodium dodecyl sulfate and a concentration of 
acrylamide between 6-20%. The ability to simultaneously resolve the marker and the sample under 

40 identical conditions allows for increased accuracy. It is understood, of course, that many different 
techniques can be used for the determination of the molecular weight of an unknown protein using 
polypeptides of the invention, and that this embodiment in no way limits the scope of the invention. 
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5 In addition, each unglycosylated polypeptide or fragment thereof has a pi that is intrinsically 

determined by its unique amino acid sequence (which pi can be estimated by the skilled artisan using 
any of the computer programs designed to predict pi values currently available, calculated using any 
well-known amino acid pKa table, or measured empirically). Therefore these polypeptides and 
fragments thereof can serve as specific markers to assist in the determination of the isoelectric point of 

10 an unknown protein, polypeptide, or fragmented peptide using techniques such as isoelectric focusing. 
These polypeptide or fragmented peptide markers serve particularly well for the estimation of apparent 
isoelectric points of unknown proteins that have apparent isoelectric points close to that of the 
polypeptide or fragmented peptide markers of the invention. 

The technique of isoelectric focusing can be further combined with other techniques such as 

15 gel electrophoresis to simultaneously separate a protein on the basis of molecular weight and charge. 
The ability to simultaneously resolve these polypeptide or fragmented peptide markers and the 
unknown protein under identical conditions allows for increased accuracy in the determination of the 
apparent isoelectric point of the unknown protein. This is of particular interest in techniques, such as 
two dimensional electrophoresis (T.D. Brock and M.T. Madigan, Biology of Microorganisms 76-77, 

20 Prentice Hall, 6th ed. (1991)), where the nature of the procedure dictates that any markers should be 
resolved simultaneously with the unknown protein. In addition, with such methods, these polypeptides 
and fragmented peptides thereof can assist in the determination of both the isoelectric point and 
molecular weight of an unknown protein or fragmented peptide. 

Polypeptides and fragmented peptides can be visualized using two different methods that 

25 allow a discrimination between the unknown protein and the molecular weight markers. In one 
embodiment, the polypeptide and fragmented peptide molecular weight markers of the invention can be 
visualized using antibodies generated against these markers and conventional immunoblotting 
techniques. This detection is performed under conventional conditions that do not result in the 
detection of the unknown protein. It is understood that it may not be possible to generate antibodies 

30 against all polypeptide fragments of the invention, since small peptides may not contain immunogenic 
epitopes. It is further understood that not all antibodies will work in this assay; however, those 
antibodies which are able to bind polypeptides and fragments of the invention can be readily 
determined using conventional techniques. 

The unknown protein is also visualized by using a conventional staining procedure. The 

35 molar excess of unknown protein to polypeptide or fragmented peptide molecular weight markers of 
the invention is such that the conventional staining procedure predominantly detects the unknown 
protein. The level of these polypeptide or fragmented peptide molecular weight markers is such as to 
allow little or no detection of these markers by the conventional staining method. The preferred molar 
excess of unknown protein to polypeptide molecular weight markers of the invention is between 2 and 

40 100,000 fold. More preferably, the preferred molar excess of unknown protein to these polypeptide 
molecular weight markers is between 10 and 10,000 fold and especially between 100 and 1,000 fold. 

■v. 
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5 It is understood of course that many techniques can be used for the determination and 

detection of molecular weight and isoelectric point of an unknown protein, polypeptides, and 
fragmented peptides thereof using these polypeptide molecular weight markers and peptide fragments 
thereof and that these embodiments in no way limit the scope of the invention. 

In another embodiment, the analysis of the progressive fragmentation of the polypeptides of 

10 the invention into specific peptides (D. W. Cleveland et al., 7. Biol Chem. 252:1102-1106 (1977)), 
such as by altering the time or temperature of the fragmentation reaction, can be used as a control for 
the extent of cleavage of an unknown protein. For example, cleavage of the same amount of 
polypeptide and unknown protein under identical conditions can allow for a direct comparison of the 
extent of fragmentation. Conditions that result in the complete fragmentation of the polypeptide can 

15 also result in complete fragmentation of the unknown protein. 

As to the specific use of the polypeptides and fragmented peptides of the invention as 
molecular weight markers, the fragmentation of the polypeptide of SEQ ID NOs:4-6 and 12-16 with 
cyanogen bromide in the absence of glycosylation generates a unique set of fragmented peptide 
molecular weight markers with molecular weights as set forth in Table 1. 

20 Table 1 . Molecular Weights of Peptide Fragments Generated by Cyanogen Bromide Digest 
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149.2 
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277.4 
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374.5 
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23,353.2 
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5 The distribution of methionine residues determines the number of amino acids in each peptide 

and the unique amino acid composition of each peptide determines its molecular weight. Where 
fragments are used, there is increased accuracy in determining molecular weight over the range of the 
molecular weights of the fragment. 

In addition, the preferred purified polypeptides of the invention (SEQ ID NOs:4-6 and 12-16) 

10 have calculated molecular weights of approximately 4,199; 13,938; 55,209; 86,983; 89459; 92,781; 
88,923; and 87,990 Daltons, respectively. Thus, where an intact protein is used, the use of these 
polypeptide molecular weight markers allows increased accuracy in the determination of apparent 
molecular weight of proteins that have apparent molecular weights close to these weights. 

Finally, as to the kits that are encompassed by the invention, the constituents of such kits can 

15 be varied, but typically contain the polypeptide and fragmented peptide molecular weight markers. 
Also, such kits can contain the polypeptides wherein a site necessary for fragmentation has been 
removed. Furthermore, the kits can contain reagents for the specific cleavage of the polypeptide and 
the unknown protein by chemical or enzymatic cleavage. Kits can further contain antibodies directed 
against polypeptides or fragments thereof of the invention. 

20 

Identification of Unknown Proteins 

As set forth above, a polypeptide or peptide fingerprint can be entered into or compared to a 
database of known proteins to assist in the identification of the unknown protein using mass 
spectrometry (WJ. Henzel et al., Proc. Natl. Acad. Sci. USA, 90:5011-5015 (1993); D. Fenyo et al., 

25 Electrophoresis, 19:998-1005 (1998)). A variety of computer software programs to facilitate these 
comparisons are accessible via the Internet, such as Protein Prospector (Internet site: 
prospector.uscf.edu), Multildent (Internet site: www.expasy.ch/sprot/multiident.html), PeptideSearch 
(Internet site:www.mann.embl-heiedelberg.de...deSearch/FR_PeptideSearchForm.html), and ProFound 
(Internet site:www.chait-sgi.rockefeller.edu/cgi-bin/prot-id-frag.html). These programs allow the user 

30 to specify the cleavage agent and the molecular weights of the fragmented peptides within a designated 
tolerance. The programs compare observed molecular weights to predicted peptide molecular weights 
derived from sequence databases to assist in determining the identity of the unknown protein. 

In addition, a polypeptide or peptide digest can be sequenced using tandem mass spectrometry 
(MS/MS) and the resulting sequence searched against databases (J.K. Eng, et al., J. Am. Soc. Mass 

35 Spec, 5:976-989 (1994); M. Mann and M. Wilm, Anal. Chem., 66:4390-4399 (1994); J.A. Taylor and 
R.S. Johnson, Rapid Comm. Mass Spec, 11:1067-1075 (1997)). Searching programs that can be used 
in this process exist on the Internet, such as Lutefisk 97 (Internet site: 
www.lsbc.com:70/Lutefisk97.html), and the Protein Prospector, Peptide Search and ProFound 
programs described above. 

40 Therefore, adding the sequence of a gene and its predicted protein sequence and peptide 

fragments to a sequence database can aid in the identification of unknown proteins using mass 
spectrometry. 
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Antibodies 

Antibodies that are immunoreactive with the polypeptides of the invention are provided 
herein. Such antibodies specifically bind to the polypeptides via the antigen-binding sites of the 
antibody (as opposed to non-specific binding). Thus, the polypeptides, fragments, variants, fusion 

10 proteins, etc., as set forth above may be employed as "immunogens" in producing antibodies 
immunoreactive therewith. More specifically, the polypeptides, fragment, variants, fusion proteins, etc. 
contain antigenic determinants or epitopes that elicit the formation of antibodies. 

These antigenic determinants or epitopes can be either linear or conformational 
(discontinuous). Linear epitopes are composed of a single section of amino acids of the polypeptide, 

15 while conformational or discontinuous epitopes are composed of amino acids sections from different 
regions of the polypeptide chain that are brought into close proximity upon protein folding (C. A. 
Janeway, Jr. and P. Travers, Immuno Biology 3:9, Garland Publishing Inc., 2nd ed. (1996)). Because 
folded proteins have complex surfaces, the number of epitopes available is quite numerous; however, 
due to the conformation of the protein and steric hinderances, the number of antibodies that actually 

20 bind to the epitopes is less than the number of available epitopes (C. A. Janeway, Jr. and P. Travers, 
Immuno Biology , 2:14, Garland Publishing Inc., 2nd ed. (1996)). Epitopes may be identified by any of 
the methods known in the art. 

Thus, one aspect of the present invention relates to the antigenic epitopes of the polypeptides 
of the invention. Such epitopes are useful for raising antibodies, in particular monoclonal antibodies, 

25 as described in more detail below. Additionally, epitopes from the polypeptides of the invention can be 
used as research reagents, in assays, and to purify specific binding antibodies from substances such as 
polyclonal sera or supernatants from cultured hybridomas. Such epitopes or variants thereof can be 
produced using techniques well known in the art such as solid-phase synthesis, chemical or enzymatic 
cleavage of a polypeptide, or using recombinant DNA technology. 

30 As to the antibodies that can be elicited by the epitopes of the polypeptides of the invention, 

whether the epitopes have been isolated or remain part of the polypeptides, both polyclonal and 
monoclonal antibodies may be prepared by conventional techniques. See, for example, Monoclonal 
Antibodies, Hybridomas: A New Dimension in Biological Analyses, Kennet et al. (eds.), Plenum Press, 
New York (1980); and Antibodies: A Laboratory Manual, Harlow and Land (eds.), Cold Spring 

35 Harbor Laboratory Press, Cold Spring Harbor, NY, (1988). 

Hybridoma cell lines that produce monoclonal antibodies specific for the polypeptides of the 
invention are also contemplated herein. Such hybridomas may be produced and identified by 
conventional techniques. One method for producing such a hybridoma cell line comprises immunizing 
an animal with a polypeptide; harvesting spleen cells from the immunized animal; fusing said spleen 

40 cells to a myeloma cell line, thereby generating hybridoma cells; and identifying a hybridoma cell line 
that produces a monoclonal antibody that binds the polypeptide. The monoclonal antibodies may be 
recovered by conventional techniques. 
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5 The monoclonal antibodies of the present invention include chimeric antibodies, e.g., 

humanized versions of murine monoclonal antibodies. Such humanized antibodies may be prepared by 
known techniques and offer the advantage of reduced immunogenicity when the antibodies are 
administered to humans. In one embodiment, a humanized monoclonal antibody comprises the 
variable region of a murine antibody (or just the antigen binding site thereof) and a constant region 

10 derived from a human antibody. Alternatively, a humanized antibody fragment may comprise the 
antigen binding site of a murine monoclonal antibody and a variable region fragment (lacking the 
antigen-binding site) derived from a human antibody. Procedures for the production of chimeric and 
further engineered monoclonal antibodies include those described in Riechmann et al., Nature, 332:323 
(1988); Liu et al., PNAS, 84:3439 (1987); Larrick et al., Bio/Technology, 7:934 (1989), and Winter and 

15 Harris, TIPS, 14:139 (May 1993). Procedures to generate antibodies transgenically can be found in GB 
2,272,440, US Patent Nos. 5,569,825 and 5,545,806 and related patents claiming priority therefrom, all 
of which are incorporated by reference herein. 

Antigen-binding fragments of the antibodies, which may be produced by conventional 
techniques, are also encompassed by the present invention. Examples of such fragments include, but 

20 are not limited to, Fab and F(ab') 2 fragments. Antibody fragments and derivatives produced by genetic 
engineering techniques are also provided. 

In one embodiment, the antibodies are specific for the polypeptides of the present invention 
and do not cross-react with other proteins. Screening procedures by which such antibodies may be 
identified are well known, and may involve immunoaffinity chromatography, for example. 

25 

Uses Thereof 

The antibodies of the invention can be used in assays to detect the presence of the 
polypeptides or fragments of the invention, either in vitro or in vivo. The antibodies also may be 
employed in purifying polypeptides or fragments of the invention by immunoaffinity chromatography. 

30 Those antibodies that additionally can block binding of the polypeptides of the invention to 

the binding partner may be used to inhibit a biological activity that results from such binding. Such 
blocking antibodies may be identified using any suitable assay procedure, such as by testing antibodies 
for the ability to inhibit binding of SVPH to certain cells expressing the binding partner. Alternatively, 
blocking antibodies may be identified in assays for the ability to inhibit a biological effect that results 

35 from binding of SVPH to target cells. Antibodies may be assayed for the ability to inhibit 
SVPH-mediated cell lysis, for example. 

Such an antibody may be employed in an in vitro procedure, or administered in vivo to inhibit 
a biological activity mediated by the entity that generated the antibody. Disorders caused or 
exacerbated (directly or indirectly) by the interaction of SVPH with cell surface binding partner thus 

40 may be treated. A therapeutic method involves in vivo administration of a blocking antibody to a 
mammal in an amount effective in inhibiting an SVPH-binding partner-mediated biological activity. 
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5 Monoclonal antibodies are generally preferred for use in such therapeutic methods. In one 
embodiment, an antigen-binding antibody fragment is employed. 

Antibodies may be screened for agonistic (i.e., ligand-mimicking) properties. Such 
antibodies, upon binding to cell surface binging partner, induce biological effects (e.g., transduction of 
biological signals) similar to the biological effects induced when SVPH binds to cell surface binding 
10 partner. 

Compositions comprising an antibody that is directed against SVPH or SVPH binding partner, 
and a physiologically acceptable diluent, excipient, or carrier, are provided herein. Suitable 
components of such compositions are as described above for compositions containing SVPH or SVPH 
binding partner proteins. 

15 Also provided herein are conjugates comprising a detectable (e.g., diagnostic) or therapeutic 

agent, attached to the antibody. 

The following examples are provided to further illustrate particular embodiments of the 
invention, and are not to be construed as limiting the scope of the present invention. 

20 EXAMPLE 1: Isolation of SVPH Nucleic Acids 

A search of the GenBank DNA sequence database revealed two ESTs that share homology 
with ADAM20 and ADAM21. X85598 showed similarity to the Cys-rich region of ADAM20, while 
AI2 14466 showed similarity to the same region in ADAM21. Both ESTs were derived from testis 
mRNA. 

25 X85598 and AI2 14466 were used to design primers, which were used to screen a human testis 

library. SVPH-1 clones were isolated by screening a human testis library (Clonetech cat no. HL3024a), 
(Cerretti et al., Proc. Natl. Acad. Sci. USA, 83:3223-3227 (1986)), at 42°C and washing at 42°C in 2x 
SSC/0.1% SDS using 32 P-labeled deoxyoligonucleotides (5-CACCTAAGGTGTTCAATTCTTTG-3' 
(SEQ ID NO: 17), S'-CAAATACTGCAAGTGAGACTTGC-S' (SEQ ID NO: 18), 

30 5'-TGCAC AACTACGTGTGGTGTACCC-3' (SEQ ID NO: 19), and 
5 , -GAGCCACTGCAATTGAAAAAGTGCCC-3' (SEQ ID NO:20). 

SVPH-4 clones were isolated under the same conditions using 32 P-labeled deoxyoligonucleotides 
(AATGATGCTCTTGCATGGTCG (SEQ ID NO:21), CTTTCACGGAGCCCATGTAGTTGCAG 
(SEQ ID NO:22), and TGAAGGAGAAAACGCGCAGATGTCGG (SEQ ID NO:23). DNAs from 
35 positively hybridizing phages were purified and characterized by restriction endonuclease mapping, 
Southern blot analysis, and DNA sequencing. 

EXAMPLE 2: DNA Sequence Analysis of SVPH 

SVPH-lc has an open reading frame of 820 amino acids (GenBank accession number 
AF171929) that encodes all of the ADAMs domains, inluding a signal sequence, pro-domain with a 
40 Cys switch, catalytic domain with a zinc-binding motif and a Met-turn, disintegrin domain, cysteine- 
rich domain, a transmembrane domain, and a cytoplasmic domain. However, SVPH-lc (as well as 
SVPH-la and SVPH-lb) has a His residue (His 333) instead of a Glu residue in the zinc -binding motif 



46 



Metalloproteinase-Disintegrin Polypeptides 



2517-USB 



5 that may affect catalytic activity. The Glu residue binds a water molecule via hydrogen binding and is 
required for enzymatic activity (Stacker, W. et al., Protein ScL, 4:823-840 (1995)). SVPH-la and 
SVPH-lb represent alternative forms of SVPH-lc with differences in the cytoplasmic domain. 
SVPH-la has a deletion of 54 amino acids resulting in a protein of 766 amino acids (GenBank 
accession number AF171930), while SVPH-lb_has a divergent 38 amino acid C-terminus resulting in 

10 a protein with 787 amino acids (GenBank accession number AF171931). These three forms of 
SVPH-1 encode cytoplasmic domains of 121, 67, and 88 amino acids, respectively. An unusual feature 
of the cytoplasmic domain of SVPH-lc is the sequence SQSQPPLMP (SEQ ID NO:32), which is 
repeated nine times. A search of GenBank did not find this repeat sequence in the database. 

SVPH-4a has an open reading frame of 790 amino acids (GenBank accession number 

15 AF171932} with all of the domains found in ADAMs. Unlike the SVPH-1 clones, SVPH-4a and 
SVPH-4b each has a consensus zinc-binding motif in the catalytic domain. One cDNA, presumably 
from an alternative RNA splicing event, deletes nine amino acids in the cytoplasmic domain and has 
been designated SVPH-4b (GenBank accession number AF171933). Interestingly, SVPH-4a and 
SVPH-4b contain a repeat sequence, QEESK(T/A)KTG (SEQ ID NO:33), in the cytoplasmic domain, 

20 which was not found in GenBank. 

As noted above, SVPH-la, SVPH-lb, and SVPH-lc diverge from the consensus zinc-binding 
cluster (HEXXHXXGXXHD) (SEQ ID NO:31) in the catalytic domain with a Glu to His change at 
position 333. To analyze these proteins further, DNA and protein sequence multiple alignments of all 
known mammalian ADAMs (www.med.virginia.edu/-jag6n/adams.html) were produced using the 

25 PBLEUP program from the Wisconsin Package (Wisconsin Package 10.1, Genetics Computer Group, 
Madison, WI). Protein multiple alignments were generated using the modified PAM scoring matrix of 
Gribskov and Burgess (Gribskov, M. et al., Nucleic Acids Res., 14:6745-6763 (1986)) provided in the 
Wisconsin Package, with gap-open and gap-extend penalties of 30 and 1, respectively. Nucleic acid 
multiple alignments were generated using a scoring matrix with A, C, G, T matches scoring unity, 

30 mismatches scoring zero, and gap-open and gap-extend penalties of 5 and 1 respectively. Unrooted 
maximum parsimony trees were estimated by the Wisconsin Package implementation of PAUP 
(version 4.0), starting from multiple alignments produced by PILEUP. PAUP parameters were set to 
use accelerated transformation character-state optimization with unordered, equally weighted 
characters. 

35 This alignment was used to infer a maximum parsimony phylogeny (Fig. 2). Due to the large 

number of taxa involved, the phylogeny was inferred using a heuristic tree search, which does not 
perform an exhaustive search of all possible tree topologies. Examination of the phylogenetic tree 
revealed an interesting pattern with respect to the presence of a zinc-binding motif. The ADAM 
sequences can be divided into two well-separated regions of the phylogeny, as marked by the arrow in 

40 Fig. 2. ADAM family members that contain a consensus zinc-binding site (HEXXHXXGXXHD) 
(SEQ ID NO:31) are clustered in one large group (heavy lines). Closely related members that do not 
have a consensus zinc binding motif (ADAMs 4, 6, 7, 1 1, 22, 23, and SVPH-1) presumably arose from 
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5 a catalytically active ancestor, as many of them encode remnants of the zinc-binding motif. For 
example, ADAM4, ADAM7, and SVPH-1 all possess the three His residues and the Asp after the third 
conserved His. Finally, the corresponding region in ADAMs 2, 3, 5, 18 and 27 is quite distinct. As 
these sequences form clusters quite divergent from the zinc-binding site-containing ADAMs, it is most 
likely that the zinc-binding site arose once in the common ancestor to the ADAMs and was lost in 
10 those lineages which do not possess a zinc -binding site (denoted by an 'X' in Fig. 2). 

EXAMPLE 3: Chromosome Mapping of SVPH 

Radiation hybrid mapping (Walter, et al., Nat. Genet, 7: 22-28 (1994)) was done using the 
GeneBridge 4 radiation-hybrid mapping panel (Research Genetics, Huntsville, AL). The panel was 

15 screened with specific primer pairs for SVPH-1 

(sense: 5'-TCGATAATGCATGAAGGCAACCCACC-3' (SEQ ID NO:24) and 

antisense: 5'-CAAGTCTCACTTGCAGTATTTGCGCC-3* (SEQ ID NO:25), and SVPH-4 

(sense; 5-GCCACTGCATGTATGGGTG-3' (SEQ ID NO:26) and antisense: 5- 

GACACTCTTTGCTTTGGGTCG-3* (SEQ ID NO:27) which generated products of 298 and 263 bp, 

20 respectively. PCR products were subjected to Southern blot analysis using an internal oligonucleotide 
probe specific for each gene. Data from two independent PCR screenings for each primer pair were 
scored against STS markers from the Whitehead Institute/MIT Center for Genome Research database 
using the statistical program RHMAPPER. LOD scores were >3.0 in all cases. 

SVPH-la, SVPH-lb, and SVPH-lc were mapped to chromosome 4q34, 1.51 cR distal from 

25 AFM312WG1. The sequential order of known markers relative to SVPH-1 on the Whitehead 
framework map was D4S1545, PDGH (Hydroxyprostaglandin Dehyrogenase 15)/SVPH-1/WI- 
21773/GPM6A (Glycoprotein M6A). This region is syntenic to mouse chromosome 8. SVPH-4a and 
SVPH-4b were mapped to chromosome lpll-13, 1.65 cR distal to D1S453. The sequential order of 
markers relative to SVPH-4 on the Whitehead framework map was CD2 (thymocyte surface 

30 antigen)/SVPH-4/3_HSD2 (3 _ Hydroxy-5-ene Steroid Dehydrogenase Type II). This region is 
syntenic to mouse chromosome 3. 

EXAMPLE 4: Tissue Distribution of SVPH 

Northern blot analysis was used to determine the tissue distribution of SVPH-1 and SVPH-4. 
35 Northern blots were purchased from Clontech (catalog number 7760-1, 7759-1, 7755-1, 7750-1). Each 
lane contained approximately 2 \ig of the indicated poly A + RNA. The blots were treated with Stark's 
buffer (50% formamide, 50mM KPO4, 5 x SSC, 1% SDS, 5X Denhardt's, 0.05% sarcosyl, 300mg/ml 

32 

salmon sperm DNA) at 63°C for at least lh and then probed with P-labeled riboprobes in Stark's 
buffer at 63°C, overnight (Cosman et al., Nature, 312:768-771 (1984)). Blots were then sequentially 
40 washed to high stringency (0.1 x SSC, 0.1% SDS, 63°C) and exposed to film. Films were developed in 
an automated x-ray film processor. SVPH-1 (nt 1068 to 1786 of SEQ ID NOs:7-9) and SVPH-4 (nt 
1343 to 1779 of SEQ ID NOs: 10-11) anti-sense riboprobes were prepared by in vitro transcription from 
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5 a T7 RNA promoter with a commercially available kit (MAXIscript, Ambion, Inc., Austin, TX) using 

[a- 32 P]-UTP as the labeled nucleotide. 

As indicated in Figure 1, both SVPH-1 and SVPH-4 were specifically expressed in testes with 
a single mRNA species of approximately 3.0 kb. No signals were detected in the other RNA samples. 

10 EXAMPLE 5: Monoclonal Antibodies 

This example illustrates a method for preparing monoclonal antibodies that bind an SVPH-1, 
SVPH-la, SVPH-lb, SVPH-lc, SVPH-4, SVPH-4a or SVPH-4b polypeptide. Suitable immunogens 
that may be employed in generating such antibodies include, but are not limited to, purified SVPH-1, 
SVPH-la, SVPH-lb, SVPH-lc, SVPH-4, SVPH-4a or SVPH-4b polypeptide or an immunogenic 

15 fragment thereof such as the extracellular domain, or fusion proteins containing SVPH-1, SVPH-la, 
SVPH-lb, SVPH-lc, SVPH-4, SVPH-4a or SVPH-4b (e.g., a soluble SVPH-l/Fc fusion protein). 

Purified SVPH-1, SVPH-la, SVPH-lb, SVPH-lc, SVPH-4, SVPH-4a or SVPH-4b can be 
used to generate monoclonal antibodies immunoreactive therewith, using conventional techniques such 
as those described in U.S. Patent 4,411,993. Briefly, mice are immunized with SVPH-1, SVPH-la, 

20 SVPH-lb, SVPH-lc, SVPH-4, SVPH-4a or SVPH-4b immunogen emulsified in complete Freund's 
adjuvant, and injected in amounts ranging from 10-100 Dg subcutaneously or intraperitoneal^. Ten to 
twelve days later, the immunized animals are boosted with additional immunogen emulsified in 
incomplete Freund's adjuvant. Mice are periodically boosted thereafter on a weekly to bi-weekly 
immunization schedule. Serum samples are periodically taken by retro-orbital bleeding or tail-tip 

25 excision to test for SVPH-1, SVPH-la, SVPH-lb, SVPH-lc, SVPH-4, SVPH-4a or SVPH-4b 
antibodies by dot blot assay, ELISA (Enzyme-Linked Immunosorbent Assay) or inhibition of binding 
partner binding. 

Following detection of an appropriate antibody titer, positive animals are provided one last 
intravenous injection of SVPH-1, SVPH-la, SVPH-lb, SVPH-lc, SVPH-4, SVPH-4a or SVPH-4b in 

30 saline. Three to four days later, the animals are sacrificed, spleen cells harvested, and spleen cells are 
fused to a murine myeloma cell line, e.g., NS1 or preferably P3x63Ag8.653 (ATCC CRL 1580). 
Fusions generate hybridoma cells, which are plated in multiple microtiter plates in a HAT 
(hypoxanthine, aminopterin and thymidine) selective medium to inhibit proliferation of non-fused cells, 
myeloma hybrids, and spleen cell hybrids. 

35 The hybridoma cells are screened by ELISA for reactivity against purified SVPH-1, SVPH-la, 

SVPH-lb, SVPH-lc, SVPH-4, SVPH-4a or SVPH-4b by adaptations of the techniques disclosed in 
Engvall et al., Immunochem. 8:871, 1971 and in U.S. Patent 4,703,004. A preferred screening 
technique is the antibody capture technique described in Beckmann et al., J. Immunol 144:4212, 
(1990). Positive hybridoma cells can be injected intraperitoneal ly into syngeneic BALB/c mice to 

40 produce ascites containing high concentrations of anti-SVPH-1, SVPH-la, SVPH-lb, SVPH-lc, 
SVPH-4, SVPH-4a or SVPH-4b monoclonal antibodies. Alternatively, hybridoma cells can be grown 
in vitro in flasks or roller bottles by various techniques. Monoclonal antibodies produced in mouse 
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5 ascites can be purified by ammonium sulfate precipitation, followed by gel exclusion chromatography. 
Alternatively, affinity chromatography based upon binding of antibody to Protein A or Protein G can 
also be used, as can affinity chromatography based upon binding to SVPH-1, SVPH-la, SVPH-lb, 
SVPH-lc, SVPH-4, SVPH-4a or SVPH-4b. 

10 EXAMPLE 6; Binding Assay 

Full length SVPH-1, SVPH-la, SVPH-lb, SVPH-lc, SVPH-4, SVPH-4a or SVPH-4b is 
expressed and tested for the ability to bind its binding partner. The binding assay is conducted as 
follows. 

A fusion protein comprising a leucine zipper peptide fused to the N-terminus of a soluble 

15 binding partner polypeptide (LZ-binding partner) is employed in the assay. An expression construct is 
prepared, essentially as described for preparation of the FLAG®-binding partner expression construct 
in Wiley et al., Immunity, 3:673-682, (1995), which is hereby incorporated by reference, except that 
DNA encoding the FLAG® peptide is replaced with a sequence encoding a modified leucine zipper 
that allows for trimerization. The construct, in expression vector pDC409, encodes a leader sequence 

20 derived from human cytomegalovirus, followed by the leucine zipper moiety fused to the N-terminus 
of a soluble binding partner polypeptide. The LZ-binding partner is expressed in CHO cells, and can 
be purified from the culture supernatant. 

The expression vector designated pDC409 is a mammalian expression vector derived from the 
pDC406 vector described in McMahan et al., EMBO J. 10:2821-2832, (1991), which is hereby 

25 incorporated by reference. Features added to pDC409 (compared to pDC406) include additional 
unique restriction sites in the multiple cloning site (mcs); three stop codons (one in each reading frame) 
positioned downstream of the mcs; and a T7 polymerase promoter, downstream of the mcs, that 
facilitates sequencing of DNA inserted into the mcs. 

For expression of full length human SVPH-1, SVPH-la, SVPH-lb, SVPH-lc, SVPH-4, 

30 SVPH-4a or SVPH-4b protein, the entire coding region {i.e., the DNA sequence presented in SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, or SEQ ID 
NO: 11) is amplified by polymerase chain reaction (PCR). The isolated and amplified DNA is inserted 
into the expression vector pDC409. 

LZ-binding partner polypeptide is employed to test its ability to bind host cells expressing 

35 recombinant SVPH-1, SVPH-la, SVPH-lb, SVPH-lc, SVPH-4, SVPH-4a or SVPH-4b polypeptide, as 
discussed above. Cells are cultured in DMEM supplemented with 10% fetal bovine serum, penicillin, 
streptomycin, and glutamine. 48 hours after transfection, cells are detached non-enzymatically and 
incubated with LZ-binding partner (5 mg/ml), a biotinylated anti-LZ monoclonal antibody (5 mg/ml), 
and phycoerythrin-conjugated streptavidin (1:400), before analysis by fluorescence-activated cell 

40 scanning (FACS). The cytometric analysis is conducted on a FACscan (Beckton Dickinson, San Jose, 
CA). 
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The cells expressing LZ-binding partner will show significantly enhanced binding of SVPH-1, 
SVPH-la, SVPH-lb, SVPH-lc, SVPH-4, SVPH-4a or SVPH-4b, compared to the control cells not 
expressing LZ-binding partner. 

The specification is most thoroughly understood in light of the teachings of the references 
cited within the specification which are hereby incorporated by reference. The embodiments within the 
specification provide an illustration of embodiments of the invention and should not be construed to 
limit the scope of the invention. The skilled artisan readily recognizes that many other embodiments 
are encompassed by the invention. 
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